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(57) Abstract 

The present invention relates to expression vectors 
comprising nucleic acid sequences which encode an affin- 
ity ligand (e.g., an enzyme, epitope) and a modification 
recognition sequence. The vectors further comprise , at 
least one restriction site for the insertion of a nucleic acid 
sequence capable of encoding a selected polypeptide. On 
expression, the resulting construct codes for a fusion pro- 
tein comprising an affinity ligand, the selected polypep- 
tide and a modification recognition sequence. The fusion 
protein may be isolated by virtue of the affinity ligand 
and then modified. The expression vectors may further 
comprise a nucleotide sequence encoding a cleavable link- 
er, such as a thrombin or factor Xa cleavage sequence. 
The invention further relates to expression vectors con- 
taining a nucleotide sequence encoding a gene for a se- 
lected polypeptide and capable of directing the expression 
of the selected polypeptide as a fusion protein. In addi- 
tion, methods of producing a modified fusion protein are 
disclosed. Modified or labeled fusion proteins of the pres- 
ent invention are useful in a variety of therapeutic, diag- 
nostic and research applications. 
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PLASMIDS FOR THE RAPID PREPARATION OF 
MODIFIED PROTEINS 

Description 



Background 

05 Current methods for the production of labeled 

proteins include iodination, biotinylation, or the in 
vivo labeling of protein. In vivo labeling methods 
require large amounts of radioactivity are required, 
and the protein of interest must be separated from 

10 other labeled cellular proteins. The biotiny lation "" 
and iodination procedures also require a purification 
step for each individual protein and, in some cases, 
reaction conditions which can cause inactivation of 
the protein. In addition, using these methods, 

13 modification of a protein may occur at a variety of 
sites, leading to distortions in structure or • 
biological activity. 

For example, iodination protocols relying on 
chloramine T or iodogen result in modifications at 

20 tyrosine residues and some histidine residues. 

Over-substitution and. oxidation damage may result. 
Labeling procedures which use the Bolton-Hunter 
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reagent result in the modification of free amino 
groups on lysine residues. m some proteins this 
particular modification may also have a deleterious 
effect on structure or activity. Although the 
lactoperoxidase method for iodination employs gentler 
conditions, this method also leads to modification of 
tyrosine and histidine residues, with the potential 
for structural distortion and loss of activity. 
^ Similarly, biotinylation protocols are 

frequently performed using a succinimide ester of 
bxotin. The biotin is coupled to the protein through 
free ammo groups, typically on lysine residues 
Again, modification at one or more positions may 
alter structure and/or function of the protein. In 
addition, extensive dialysis is needed to remove 
uncoupled biotin, which may be deleterious to the 
protein. 

Summary of th* T. nve ntion 

The present invention relates to expression 
vectors comprising nucleic acid sequences which 
encode an affinity ligand (e.g., an enzyme, epitope) 
and a modification recognition sequence. The vectors 
further comprise at least one restriction site for 
th. insertion of a nucleic acid sequence capable of 
encoding a selected polypeptide in frame with the 
affinity ligand and modification sequence. On 
expression, the resulting construct codes for a 
fusion protein comprising an affinity ligand, the 
selected polypeptide and a modification recognition 
30 sequence. 
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The fusion protein may be isolated by virtue of the 
affinity ligand and then modified. 

The expression vectors may further comprise a 
nucleotide sequence encoding a cleavable linker, such 
as a thrombin or factor Xa cleavage sequence. The 
sequence encoding the cleavable linker is located 
between the affinity ligand and the restriction site 
for insertion of the sequences encoding the selected 
polypeptide. In this location, the linker may be 
cleaved to release the protein of interest . following 
modification. The invention further relates to 
expression vectors containing a nucleotide sequence 
encoding a gene for a selected polypeptide and 
capable of directing the expression of the selected 
polypeptide as a fusion protein. 

The pGEX-2TK expression vector is one embodiment 
of the present invention. This vector encodes a 
protein comprising, from amino to carboxyl terminus, 
glutathione-S-transf erase (GST) as an affinity 
ligand, the thrombin cleavage site as a cleavable — 
linker, and a phosphorylation recognition site for 
the cAMP-dependent protein kinase as a modification 
recognition sequence. A multiple cloning site 
comprising three restriction sites is located 
downstream of the sequence encoding the 
phosphorylation site. Thus, a nucleic acid sequence 
encoding a selected polypeptide may be inserted into 
the vector using one or more of these sites. The 
selected polypeptide is expressed as a GTK-fusion 
protein (G, GST; T, thrombin; K, kinase). 
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In addition, methods of producing a modified 
fusion protein are disclosed. Upon expression in a 
suitable host cell, the protein of interest is 
Produced as a fusion protein. The fusion protein can 
be captured on a suitable affinity matrix by virtue 
of an affinity ligand, which interacts reversibly 
with the matrix. Modification of the fusion protein 
can be carried out while the protein is attached to 
the matrix. Subsequently, the modified fusion 
protein may be isolated for use by releasing the 
fusion protein from the affinity matrix with a 
stable agent. In the case where a cleavable linker 
is present, the fusion protein may be cleaved in 
^ vitro to free the modified polypeptide portion, and 
the aff^ity ligand portion or any uncleaved product 
can be removed by adsorption on the appropriate 
affinity matrix. Alternatively, the modified 
Polypeptide portion can be released from the affinity 
2q ligand portion by cleaving the modified fusion 

Protein at the cleavable linker while still bound to 
the column. 

Modified proteins of the present invention are 
useful ma variety of applications. Labeled (e a 
^ radiolabeled) proteins may be used as molecular ' ' 
Probes. For example, antibodies can be labeled for 
therapeutic, diagnostic (e.g., Paging) or research 
purposes. Proteins may be labeled and used as 
reagents to guantitate or identify an interacting 
protein, such as a receptor. 
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Brief Description of the Drawings 

Figure 1 shows a portion of the nucleotide 
sequence around the kinase recognition site of 
expression vector pGEX-2TK. The portion of the 
PGEX-2TK sequence introduced by the synthetic duplex 
is indicated by italics. The sequences which encode 
the linker cleavable by thrombin (Leu-Val-Pro-Arg- 
Gly-Ser) , the downstream kinase recognition site 
(Arg-Arg-Ala-Ser-Val) and multiple cloning site with 
BamHI, Smal and EcoRI sites, are shown. The arrow 
indicates the point of thrombin cleavage. 

Figure 2 is an illustration of the structure of 
expression vector pAR (A RI) 59/60. The general 
structure of the plasmid is shown at top. bla , 
indicates the ^-lactamase gene which confers 
ampicillin resistance; ori, indicates the origin of 
replication. In the center panel, the general 
structure of FEK-fusion proteins is illustrated. The 
FLAG peptide portion, comprising the FLAG epitope and 
enterokinase cleavable linker, is indicat-ed. "HMK" 
indicates the location of. the phophorylation site. 
"Protein" indicates the location of the selected 
polypeptide. The lower panel shows a more detailed 
illustration of the N-terminal region of the vector. 
The peptide sequence shown (Met-Asp-Tyr-Lys-Asp-Asp- 
-Asp-Asp-Lys-Ala-Arg-Arg-Ala-Ser-Val-Glu-Phe-) in the 
detail is a contiguous sequence. The extent of the 
FLAG peptide, the enterokinase cleavage site, and HMK 
recognition (phosphorylation site) are indicated. 

Figure 3 is a bar graph showing the effect on 
P incorporation in vitro of an HMK sequence on 
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different fusion proteins, comprising portions of the 
retinoblastoma susceptibitliy gene product, RB. 
Proteins were phosphorylated in vitro using 
cAMP-dependent protein kinase. The hatched bars 
indicate that the fusion protein (GST-RB379-792 or 
GST-RB379-792;pm706) was expressed from pGEX-2TK and 
contained an HMK sequence, while the solid bars 
indicate that the fusion protein (GST-RB379-792 or 
CST-RB379-792;p.706) was expressed from p GE x- 2T and 
did not contain an HMK sequence. 

Unless indicated otherwise, the orientation of 
particular amino acid sequences is such that the 
ammo end is on the left and the carboxyl end is on 
the right. 

15 g eta iled Description af ^h e invention 

The present invention relates to expression 
vectors comprising nucleic acid sequences which 
encode an affinity ligand and a modification 

20 a r ! C ° 9nition Se * uence - ™. vectors further comprise 
east one restriction site- for the insertion of a 
nucleic acid sequence capable of encoding a selected 
polypeptide. The expression vectors may further 
comprise a nucleotide sequence encoding a cleavable 
linker sequence. These vectors are referred to as 

° parent vectors. 

The present invention further relates to 
expression vectors which are derived from the parent 
vectors described above, by the insertion of a 
o nucleic acid sequence capable of encoding a selected 
polypeptide (e.g., a natural or synthetic cDNA or 
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genoinic DNA which encodes the selected polypeptide 
and is capable of being expressed in the appropriate 
host cell) into the parent vector. The parent 
vector can be cleaved at one or more restriction 
05 sites in order to insert sequences encoding a 

selected polypeptide using known techniques. Linkers 
or other sequences can be used to facilitate 
insertion. Insertion at an appropriate restriction 
site or site in the parent vector results in the in 
10 frame fusion of the sequences of the selected 
polypeptide with the amino acid sequences for an 
affinity ligand, modification recognition sequence 
and, if present, the optional cleavable linker 
encoded by the parent vector. . On expression, the 
15 resulting fusion gene encoded by the vector codes for 
a fusion protein comprising an affinity ligand, a 
modification recognition sequence, selected 
polypeptide, and optionally, a cleavable linker. The 
location of the restriction site or sites selected 
for insertion in the vector will determine the 
location of the selected polypeptide relative to the 
affinity ligand, modification recognition sequence 
and optional cleavable linker in the encoded fusion 
^ protein. The selected polypeptide element of the 
fusion protein comprises at least one peptide, 
polypeptide or protein of interest. 

The fusion protein comprising an affinity 
ligand, a modification recognition sequence, an 
^ optional cleavable linker and a selected polypeptide 
forms a contiguous polypeptide chain. The order of 
these components in the fusion gene and protein can 
vary. The location of these components or additional 
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components in th . fusion gene ^ 

IlT lneCi by °" ° rder in * ieh "»cl.ic acid 

05 nro t P " ent V6Ct0r - eXam " le ' f --" 

ITT- £ ° UOWing StrU « U " "» - -de 

the modification recognition sequence, c indicates 
the optionai cieavable iinfcer. and x indicates the 
polypeptide of interest an* «.», „ 
10 °f the fu «.„ • St ' and the ""terminal portion 

cne fusion is on the left: 

A-(C)-M-X 
A-M-(C)-X 
X-(C)-M-A 
X-M-(C)-A 

15 

M-A-(C)-X 

HoweTrTe'"! 0 " 5 " ^ "« 

hetween the aL ' ^ iS *™-»«y iocated 

- el theTsL pTteT 

::::::;°d f : seiected ---- "--t — 

simplified by a terminal location for thP m 

(X»; This nation can aiso rai nL 2 e ^T* 

» a tTTr T bi ° 109iCal ^ C... hindin 

x-«-cT x r t co " ponents can be <••••. 

can be i eXi "" Ple ' BUl " Ple "">«"<-tion sites 

can be incorporated to increase the intensity of 
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labeling. These sites may be contiguous or 
noncontiguous in the encoded fusion protein. 
Furthermore, additional sequences (S) can be present 
in the fusion protein. For instance, a sequence 

05 encoding a signal peptide or leader peptide may be 

incorporated into the vector, and the signal peptide 
will be produced as part of the fusion protein. In 
the case of a signal peptide, the additional sequence 
is preferably incorporated at the N-terminus (e.g., 

10 S-X-M-OA) However, for other sequences, an 

internal or C-terminal location in the fusion protein 
may be desired. 

Note that on expression in a suitable host cell, 
the parent vector may also produce the affinity 

15 ligand, modification sequence, and optional cleavable 
linker sequences as a fusion protein. This product 
may contain additional sequences encoded by the 
vector, depending on the location of a promoter or 
termination signals relative to the coding sequences; 

20 for these elements. In some cases, restriction sites 
for insertion of sequences encoding a selected 
polypeptide may disrupt the reading frame of . 
components encoded by the parent vector. Insertion 
of sequences encoding a selected polypeptide will 

23 restore the reading frame. 

The expression vectors of the present invention 
can be designed for use in a variety of host cells, 
including bacterial host cells such as E. coli and 
eukaryotic host cells (e.g., yeast cells and 

30 mammalian cells). Thus, fusion proteins can be 

glycosylated. Like other expression vectors, in the 
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15 



expression vectors of the present invention, a 
Promoter is provided for expression of the fusion 
Protexn in a suitable host cell. Suitable promoters 
05 the C ° nStitUtiVe ° r -ducible. In the vectors , 
the propter is operably linked to nucleic acid 
seances encoding the fusion protein and is capable 
of dxrectxng the expression of the corresponding 

Polypeptide. A variety of suitable promoters for 
pro Caryotic (e . g>< lac promoter( 

ZTuTsl h ° StS (e - 9 -' yeaSt 3lC0h01 ^nydrogenase 
(ADHl ) , SV40) are available. 

In addition, the expression vectors typically 
=- P r a se a selectable marker tor selection of best 
cells carrying the plasmid and an origin of 

vector. Genes encoding products which confer 

TnT^u »• ~ selectable markers 

and b y be used in prokaryotic (e.g.. .-lactamase for 
» «"«ance. tetracycline resistance, and 

eukaryotic cells (e.g., 0418 , . Ge nes e „ cod 
gene product of auxotrophic barkers (e.g., LEU2 and 

ara C °"°"° nl y « selectable barkers in 

yeast, use of viral or phage vectors, and vectors 
which are capable of integrating into the genome of 
.he host cell, such as retroviral vectors, are also 
on ebplated. T he present invention also relate to 
cells carrying these types of expression vectors 
( e -9., transformed cells). 
3o ^e incorporation of one or more modification 

"cognition sequences into the fusion protein a!lo„s 
the production of modified fusion proteins. A s shown 
m the Examples, incorporation of a detectable label 
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(e.g., radioactive, fluorescent) at the modification 
site provides a convenient way to label the fusion 
protein. Thus, the invention further relates to the 
fusion proteins and modified (e.g., labeled) fusion 

05 proteins produced from expression vectors of the 
present invention. It is also possible to produce 
fusion proteins of the present invention using 
methods of peptide synthesis, or in vitro 
transcription/translation procedures . Modification 

10 of synthetic fusion proteins is also possible. 

The affinity ligand present in the fusion 
proteins of the present invention is a polypeptide 
encoding an affinity ligand. The affinity ligand 
(i.e., an affinity ligand or portion thereof) is a 

15 member of a specific binding pair and is capable of 
binding to a specific binding partner. Preferably, 
the binding of the affinity ligand to its specific 
binding partner is reversible, allowing recovery of 
fusion proteins comprising the affinity ligand where 

20 desired. Examples of affinity ligands include, but 
are not necessarily limited to, antibodies or 
portions thereof, antigens (e.g., influenza 
hemagglutinin) or epitopes (e.g., FLAG epitope), 
enzymes (e.g., glutathione-S-transf erase (GST), 

25 /9-galactosidase ( lacZ ) , the trpE product) , hormones, 
growth factors or other proteins capable binding to a 
specific binding partner (e.g., maltose binding 
protein, histidine hexamer (a heavy metal binding 
element) , and protein A) . A specific affinity ligand 

30 is an affinity ligand comprising that protein or 
peptide or a portion thereof. Thus, a 
glutathione-S-transf erase affinity ligand is an 



SUBSTITUTE SHEET 



WO 93/03157 



PCI7US92/06187 



-12- 



" 1Wty Ugand ""Poising giutathione-s-transferase 
° a .P="i°n thereof. „ hich is capafcle 

• -s« ate| . The 

05 useful ' ' P0 " i0n " fUSi °" """" ^ 

-l.ty for attachment to a support or to facilitate^ 
purification or identification. 

For example, the fusion proteins can be 
captured on an appropriate affinity matrix. An 
affinity matrix is a solid support to which is 

partner T^""" • «P~«le binuing 

partner. Fusion proteins of the present invention 
comprising an affinity ligand , ^ ^ 

• ^ding partner via the affinity ligand part of the 
fusion protein. In the case of an affinity ligand 

a " a " ti9en - a " « P=«ion thereof 

can be used as a specific bindin, partner in an 
affinity matrix. Alternatively, an antigen or hapten 

» : y c ; :ir rated int ° a - a " ini ^ 

an Iff °" Pr ° teinS ■» antibody as 

an affinity li gana . a number of affinity 

Ugand/affinity matrix pairs are available 
Glutathione-s-transferase fusion proteins may be 
captured on immobilized glutathione as an affinity 
*atr Ulth glutathione as ^^^^ m y 

P«tner. For example, glutathione sepharose 
(Pharmacia, or glutathione agarose beads (Sigma 
Che^cal corp., can be used. Fusion proteins 

30 pep"tioe S1 " 9 " Pr0 " in *' " altOSe bindi "' FLAG 

b ^ camturel " T^"' """^ ^ ~ 

be captured on Zgc Sepharose 6FF (Pharmacia, , anylose 
resin (New E„ 9 i and Biolabs) _ ^ ^ ^ 
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anti-FLAG M2 antibody affinity resins (International 
Biotechnologies, Inc.), or a Ni 2+ affinity resin (NTA 
resin, Qiagen) , respectively. In the foregoing, IgG, 
amy lose, anti-FLAG antibodies, or a heavy metal are, 
respectively, the specific binding partners. 

When desired, a fusion protein or modified (e.g. 
labeled) fusion protein bound to an affinity matrix 
can be released by contacting the affinity matrix 
with bound fusion protein thereto with a suitable 
elution buffer comprising one or more release 
components. The release component or components can 
be molecules which compete with the fusion protein 
for binding to the affinity matrix (e.g., hapten, 
free peptide epitopes, substrate or substrate 
analogs) , or which can disrupt binding of the 
affinity ligand to the specific binding partner. For 
example, an elution buffer (a buffered solution) 
suitable for releasing fusion proteins comprising a 
glutathione-S-transf erase affinity ligand can be 
formulated comprising reduced glutathione as a 
release component. In one embodiment, an elution 
buffer comprising reduced glutathione, Tris and NaCl 
is used. 

Alternatively, the elution buffer may comprise a 
buffered solution which lacks a specific component 
required for binding. For example, a fusion protein 
with an ompA signal sequence followed by a FLAG 
affinity ligand at the amino terminus can be 
expressed in £■ coli . Specific removal of the ompA 
signal sequence upon secretion into the periplasmic 
space results in a fusion protein with an 
N-terminally located FLAG affinity ligand, which is 
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20 



30 



capable of binding to the anti-FLAG M1 antibody in 
the p of calc . m Th . s ^ £usion 

«n be eluted fro* an anti-F«G m ant ibody affinity 

03 a o^r 9 an eiution tuf£er 

-pecifio oonponent required for binding 

oan forth eXP " SSi0n ""^ °' ■* in «»«on 

oan further oonpnee a seguenoe enooding a oleavable 

suoh a'c/ " UCle ° tide "^•»« capable of enooding 

"> elreLr """ * in ™«ed -to an 

expression veotor using known teohnigues (e g 
reoo nbinant MA and/or Mtagenes , s) a ciMvaWe 

oleaveVo " °* P ° lyPeptide «P»W of being 

thro.bxn cleavage (e.g., Leu-Val-Pro-Arg-Gly-Se r) 
fetor Xa cleavage site (e.g., Ile- G1 u-Glv-Lg> or 
antero ki „ nsa cleavage ^ (> g * Arg, or 

Asp-Asp-Lys, ma y be used as a oleavable linker The 
appro iat . protease (thrombi ^ ^ £r. The 

b^nase, respectively, oan be used to oleava a fusion 

c p :::::;:r ai " in ' a — — — ; in - 

a e eav b V* 0 *'" can ocour within 

oleavable l lnlC er or at the border of the linker and 
another opponent o, a fusion protein 
The product of t „. eleava9e react . 

.acted £usion protein conprlsing 

the affinity ugand is referred to as the affinitv 
ligand portion „, .. affinity 

' , portl °"' "* the produot of the oleavage 

referred ^ **WPtM. is 

referred to as the seleoted polypeptide portion of 
the fusion p rot ei„. These portions M y comprise 
other parts of the enooded f usio „ proteln { > 
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fusion protein encoded by the expression vector) , and 
such are also fusion proteins. For example, the 
selected polypeptide portion can include the 
modification recognition sequence. If so, and if the 
05 fusion protein has been modified (e.g., labeled), 

then the selected polypeptide portion is referred to 
as the labeled selected polypeptide portion. Thus, 
the term fusion protein refers to a protein, 
polypeptide, or glycoprotein comprising at least two 
10 of the components selected from the group consisting 
of an affinity ligand, modification sequence, 
cleavable linker sequence, selected polypeptide or 
additional sequence. 

Preferably, a cleavable linker is selected which 
15 does not cleave elsewhere within the fusion protein 
(e.g., in the selected polypeptide, affinity ligand 
or modification site). In addition, the sequences 
encoding the cleavable linker are preferably located 
between those sequences encoding the affinity ligand 
20 and selected polypeptide. In this location, the 

encoded fusion protein can be cleaved to separate the 
affinity ligand portion of the fusion protein from 
the selected polypeptide portion where desired. 
Similarly, where recovery of a labeled selected 
25 polypeptide portion is desired, the sequences 

encoding the cleavable linker will be on located in 
the vector either upstream or downstream of the 
sequences encoding both the modification recognition 
sequence and the selected polypeptide (e.g., as in 
30 a-C-M-X or X-M-C-A fusions) . 

Overlap of the sequences of portions of the 
fusion protein may occur provided the respective 
portions retain function. For example, in one 
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embodiment of the present invention, pAR( A Ri) 59/60 , 
discussed in more detail below, the affinity ligand 
(FLAG epatope) sequence and cleavable linker sequence 
(enterokinase site) overlap by one amino acid. The 
sequence of the FLAG epitope from N- to C-terminus is 
(Asp-Tyr-Lys-Asp) , while the enterokinase cleavage 
site in this embodiment is (Asp-Asp-Asp-Asp-Lys) . 
Within pARURi, 59/60, these sequences overlap to give 
the flag peptide, an octapeptide of the sequence 
Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys, which retains the 
functions of the affinity ligand (e.g., binding to a 
specific binding partner) and cleavable linker (e.g 
cleavage by enterokinase) . Similarly, other 
components of the fusion protein can overlap 
providing their function is preserved \ 

A modification recognition sequence is also pro- 
vided. This sequence, incorporated into the fusion 
Protein, directs modification of the fusion protein. 
Modification recognition sequences can be 
incorporated into a fusion protein comprising a 
selected polypeptide which either is naturally 
modified or is not naturally modified A 
-edification recognition sequence for phosphorylation 
_ (i.e., a phosphorylation site) can be introduced into 
a vector and expressed as part of a fusion protein 
for example, m the presence of a suitable protein 
kinase, the fusion protein comprising the 
Phosphorylation (kinase) site will be 
Other types of modifications directed by the presence 
of a peptide or polypeptide sequence are envisioned 
m the present invention. For example, glycosylate 
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reactions can be directed by a short peptide 
sequence. Similarly, fatty acylation of some 
proteins is directed by a short peptide sequence 
(Cys-Ala-Ala-X) . The identification of additional 
recognition sites and the identification and 
purification of the corresponding modification enzyme 
or enzyme complex will provide alternative protocols 
for modification of the fusion proteins. 

Modification of the fusion proteins is carried 
out using a modification enzyme or enzyme complex or 
other suitable means. In the case of a 
phosphorylation reaction, a suitable phosphorylation 
method is used. Typically, the phosphorylation of 
proteins is carried out by a protein kinase. Many 
such kinases have been described and have utility in 
the present invention (see e.g., Kemp, B.E., et_al. , 
J. Biol. Chem.. 252 : 4888-4894 (1977); Edelman, A.M. 
et_al. , Ann. Rev. Biochem . 56: 567-613 (1987); Glass, 
D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol. 
20: 363-388 (1980); Hunter, T. and J. A. Cooper, Ann." 
Rev. Biochem. 54: 897-930 (1985); Hunter. T. , Cell 
50: 823-829 (1987)). Known protein kinases catalyze 
the transfer of the 7-phosphate group of ATP to the 
hydroxyl groups of serine and/or threonine residues 
on specific substrates or alternatively, catalyze the 
phosphorylation of tyrosine residues. Use of protein 
kinases of both specifities is contemplated in the 
present invention. Serine/threonine kinases include 
cyclic AMP (cAMP) dependent and cyclic GMP (cGMP) 
dependent protein kinases, and cyclic nucleotide- 
independent protein 
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kinases . 



A variety of serine/ threonine kinases have 
been identified including glycogen synthase kinase, 
phosphorylase kinase, casein kinase , oasein kinase 

05 It PyrUVate """J*""""' Protein kinase c 

and nyosm light chain kinase. 

Amino acid sequences present in natural 
substrates and artificial peptide substrates which 
are sufficient for activity as a protein kinase 
substrate have been identified ( see e.g., E del„an, 
*•"■ «S_S1-. Ann. Rev. Bioche,. 56 : 567 . 613 
««.. »... and E.G. Krebs, Ann. Rev. Ph^.L 
22-=£l. 20: Hunter , ~ ^ ~ 

C °°P". Ann. Hey. Binrhg . 54 : 897 . 930 

rno acid sequences, in addition to those described 

specmcai! Min0 ^~ " hi = h -» 

specifically phosphorylated, can be used as 

"edification recognition seguences in vectors of the 
cap le 7T°"- * «««-» kinase 

to oh T recognition sequence, can be used ' 

invent fUSi ° n Pr ° teinS ° f "» 

invention, ror e^ple, the cAHP-dependent protein 

kinase (e.g., the catalytic subunit of the 
^ cAKP-dependent protein kinase fro* bovine heart 

auscle) can recogni.e the consensus apino acid 

sequence Arg-Aro-Xaa-^r-v,- 

acid in Ser-xaa, where Xaa is an amino 

add ln a varxety of substrates, resulting in 
phos Ph o rylation of ^ As shown ^ 

30 pel °: vect ; rs of the present inventi ° n 

PGEX 2TK derxvatxves, or pAR URIJ 59/60 derivatives) 
incorporating a seguence which encodes a version o ' 
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the cAMP-dependent protein kinase recognition 
sequence (Arg-Arg-Ala-Ser-Val) , can direct the 
production of fusion proteins capable of being 
phosphorylated by cAMP-dependent protein kinase from 
bovine heart muscle. The Arg-Arg-Ala-Ser-Val 
sequence is also referred to as the HMK site (HMK, 
heart muscle kinase) . Other specific amino acid 
sequences, such as Arg-Arg-Ala-Ser-Leu (Li et al . , 
Proc. Natl. Sci. USA 86: 558-562 (1989) and the 
peptide Arg-Thr-Lys-Arg-Ser-Gly-Ser-Val , can be 
recognized and phosphorylated by this kinase. The 
phosphorylation sites can function at a terminal or 
internal location within a fusion protein. 

cGMP protein kinases have a substrate 
specificity that is similar, but not identical, to 
that of the cAMP-dependent protein kinases. 
Comparative analysis of substrates has been made 
(Edelman, A.M. et al . , Ann. Rev. Biochem . 56: 
567-613, (1987); Glass, D.B. and E.G. Krebs, Ann. 
Pev. Pharmacol. Toxicol. 20: 363-388 (1980); Glass, 
D.B. and E.G. Krebs, J. Biol. Chem. 254 : 9728-9738 
(1979)). The substrate specificities of the two 
types of casein kinases has also been studied, and 
the activity of peptide substrates has been compared 
(Marin, O. et al . , Eur. J. Biochem. 160 : 239-244 
(1986); Sommercorn, J. and E.G. Krebs, J. Biol. Chem . 
262: 3839-3843 (1987); Kuenzel, E. A. et al ., J. Biol. 
Chem. 262: 9136-9140 (1987)). For example, casein 
kinase II was shown to phosphorylate the synthetic 
peptide Ser-Glu-Glu-Glu-Glu-Glu. Additional peptide 
substrates were phosphorylated by casein kinase II 
(e.g., in decreasing order of activity, 
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^-Arg-Arg-Asp-Asp-Asp-ser-Asp-Asp-Asp 

oL Th lu_Glu " Glu; »*«-**»-**»-««-«»• 

"*- tlM1M1 " • «-tr.f specificity of 

tyrosine kinases has also been stud . ed (Hunt ^ t 

^ and J. a. cooper, Ann^ Rev. B iochm . 5,. 897 . 930 ' 
U985)). candidate phosphorylation sites 
(recognition sequences, can be incorporated into 
synthetic peptides or into fusion proteins and 
assayed for activity as protein kinase substrates, 
"sing techniques similar to those described 
previously. Because corresponding phosphatases 

possible"^"' rem ° Val ° f Ph ° SPhate ^ ^ 

The ability to modify the fusion proteins 
provides a convenient method of specifically labeling 
the fusion proteins with detectable radioactive or 
non radioactive labels to produce a labeled fusion 
Prctem (e.g., a [ 32 P, -labeled fusion protein,. 
Cleavage of fusion proteins can produce a selected 
2o Polypeptide portion comprising a .edification 

sequence or an affinity ligand portion comprising a 

Hi IT Unker ln relati °" to these components. 

Thus cleavage following labeling can produce a 

2j labeled selected polypeptide portion or a iabeled 
"finrty ligand portion, each of which is itself a 
fusron protein. Because the modification recognition 
sequence directs modification to a specific 
the protein, there is a nigh degree of control over 
location and extent o, modification. As shown in 
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the Examples, fusion proteins expressed and modified 
by the introduction of a detectable label retain both 
structure and function. 

The detectable label (e.g., radioactive, 
fluorescent or chemiluminescent) selected will 
determine by the nature of modification, the 
modification enzyme, and the intended use. The label 
will be incorporated into a moiety which is 
transferred to the fusion protein substrate. For 
phosphorylation reactions, [ 7 -labeled] ATP is used as 
phosphate donor (i.e., modification donor). As 
protein kinases transfer the 7-phosphate onto the 
substrate during the phosphorylation reaction, the 
label will be incorporated into the moiety 
transferred by the protein kinase. For example, to 
incorporate a radioactive phosphate label such as 

31 P, 32 P or 33 P, phosphate donors such as [7 31 P]ATP, 

32 33 
[7 P]ATP, or [7 P]ATP can be used. Alternatively, 

3 5 3 8 

isotopes of sulfur, such as S or S isotopes, can 
be incorporated as the label in the [ 7-labeled ] ATP, 
as for example in 35 S-labeled adenosine 
5' [ 7 -thio]triphosphate. The term ''phosphorylation" 
also refers to modification with such thiophosphate 
analogs or other 7-labeled analogs of ATP. Other 
considerations such as specific activity, half-life, 
type of particle emitted and energy of radiation will 
influence selection of an appropriate radioactive 
label. In addition, a fluorescent moiety or 
chemiluminescent moiety incorporated into the 
7-phosphate. Detection methods for such labels are 
well known in the art. 
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In other types of modifications (e.g., glyco- 
sylate, fatty acylation), a radioactive isotope 
chemiluminescent or fluorescent dye, or other 
nonradioactive label, could be incorporated into the 
verification donor so that it is transferred to the 
~us 10 n protein during the modification reaction. For 
example, a biotin adduct of a modification donor 
could be linked to a fusion protein. In the present 
aethod, the site of addition of the biotin label is 
controlled by the location of the modification 
recognition sequence. 

Specific Embodiment* 

In one embodiment of the present invention, 
glutathione-s-transferase (GST) from the parasitic 
helmmth S chistosoma iajspnica (Mr=26,ooo) is selected 
as the affinity ligand. However, GST genes from 
other sources, such as other bacteria or mammalian 
organisms, can be used. Xn addition, a portion of -a 

96,16 enC ° din 9 3 of GST capable of binding 

a specific binding partner, such as the substrate 
glutathione, can be used. m particular, expression 
vectors capable of expressing s. japonicum GST-fusion 
proteins (i.e., a fusion protein comprising a GST 
affinity ligand) were constructed. The parent' vector 

J PGEX_2TK ' V6Ct0rS d6riVed fr °™ P«»-2T* 

by the insertion of a nucleotide sequence encoding a 

^"^ Polypeptide, are referred to with the prefix 
PGTK-. The construction of pGEX-2TK and pGTK- 
plasmids is described in Examples l and 2 and in 
Figure l. 



WO 93/03157 



PCT/US92/06187 



-23- 

PGEX-2TK is a derivative of pGEX-2T, the latter which 
is described by D.B. Smith in EP 0,293,249, published 
November 30, 1988, PCT/AU88 / 00164 , published December 
1, 1988, and New Zealand Patent No. 224,663, issued 
November 27, 1990, and by D.B. Smith and K.S. Johnson 
in Gene 67: 31-40 (1988), The teachings of EP 
0,293,249, PCT/AU88/00164 , New Zealand Patent No. 
224,663, and Smith and K.S. Johnson ( Gene 67 : 31-40 
(1988)) are herein incorporated by reference. 

pGEX-2TK encodes a fusion protein having GST as 
an affinity ligand at the amino terminus, followed by 
a thrombin cleavage site (cleavable linker) . In 
addition, a modification recognition sequence for a 
protein kinase was incorporated downstream of the 
cleavable linker. In particular, a sequence of the 
structure Arg-Arg-Xaa-Ser-Xaa , where Xaa is an amino 
acid, was selected (Arg-Arg-Ala-Ser-Val) . This 
sequence is a phosphorylation site recognized by 
cAMP-dependent protein kinases such as the catalytic 
domain of the cAMP-dependent protein kinase from 
bovine heart muscle. In pGEX-2TK, the sequence which 
encodes the thrombin cleavage site is followed by 
several restriction sites for the insertion of 
sequences encoding a selected polypeptide. pGEX-2TK 
retains the inducible tac promoter for expression, 
three in frame stop codons, a selectable marker 
(ampicillin resistance), origin of replication, and 
laclq gene present in pGEX-2T. 

When a sequence which encodes a selected 
polypeptide is inserted in frame into the pGEX-2TK 
expression vector downstream of the phosphorylation 
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site, a GTK-fusion protein comprising the selected 
polypeptide can be produced. The GTK-fusion protein 
(where, from N- to C-terminus, G is a GST affinity 
Q5 ligand. T is a thrombin cleavage site, and K is a 

Phosphorylation site for a protein kinase) is encoded 
by a pGTK-vector, can be produced upon expression in 
a suitable E_. — coli host. These fusion proteins can 
be captured on immobilized glutathione, and labeled 
iQ by Phosphorylation with cAMP-dependent protein 

^nase, using [7 - 32 P]ATP as a modification donor. As 
shown in the Examples, several different cDNAs were 
expressed from pGEX-2TK, and labeled as described. 
The studies described in Examples 1-4 indicate that a 
selected polypeptide present in a GTK-fusion protein 
encoded by a pGTK-type vector, such as pGTK-RB 
(379-792), can retain both structure and function. 
Furthermore, the labeling (phosphorylation) procedure 
was eff icl ent and did not alter the structure or 
function of the selected polypeptides, 

in another embodiment of the present invention, 
the FLAG epitope is selected as the affinity ligand. 
Vector pAR(aRl) 59/60 , shown in Figure 2, is an 
example of this type of expression vector. Vector 
25 PARUPI, 59,60 is a derivative of bacterial expression 
Plasm.d pET3a (p a*3040) described by studier et al., 
*^_?n^ (1M0)f the teachingro7 

which are herein incorporated by reference. The 
construction of pAR(.Ri) 59/6O is described in detail 
in Example 5. m this construct, the FLAG peptide is 
30 located adjacent to the N-terminal initiator 
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methionine (Figure 2). The FLAG octapeptide 
, (Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys) comprises the FLAG 
epitope (Asp-Tyr-Lys-Asp) affinity ligand and an 
overlapping enterokinase cleavable linker 
05 (Asp-Asp-Asp-Asp-Lys) . Cleavage with enterokinase 
typically occurs precisely after the terminal lysine 
(underlined above) of the FLAG octapeptide. The FLAG 
octapeptide is followed by a kinase recognition site. 
In particular, in pAR (ARI ) 59 / 60 , the site is 
10 Arg-Arg-Ala-Ser-Val (HMK) , recognized by cAMP- 

dependent protein kinase. A unique EcoRI restriction 
site located downstream of the sequence encoding the 
phosphorylation site can be used for the insertion of 
a sequence encoding a selected polypeptide. In 
15 plasmids derived from pAR (&RI ) 59/ 60 , referred to with 
the prefix pFEK- F for FLAG, E for enterokinase, and 
K for the phosphorylation recognition site) , in which 
a sequence encoding a selected polypeptide has been 
inserted, the encoded fusion protein, referred to 
20 herein as a FEK-fusion protein, has the following 
general structure for N- to C-terminus: . Met- [FLAG/ 
enterokinase cleavable linker ] -phosphorylation 
site-selected polypeptide. An alanine residue is 
located between the FLAG peptide and HMK site (see 
25 Figure 2). m addition, the EcoRI site encodes a 
Glu-Phe dipeptide. Depending on the strategy for 
insertion of a sequence encoding a selected poly- 
peptide, additional residues (e.g., encoded by a 
linker or PCR primer) may be inserted between the 
selected polypeptide and the modification recognition 
(HMK) site. In many cases, however, only 17 amino 
acids will be fused to the selected polypeptide. 
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pFEK-plasmids can direct the expression of FLAG- 
fusion proteins (i.e., fusion proteins having a FLAG 
epitope affinity ligand) from the T7 polymerase 
promoter in an appropriate host (e.g., a bacterial 
cell capable of constitutive or inducible expression 
of the T7 polymerase; for examples of suitable hosts 
and induction protocols, see Studier et al. , Meth. 
£n2 y mo1 - . US: 60-89 (1990)). The construction of 
several pFEK-plasmids is described in Example 5. The 
encoded fusion proteins were expressed in a bacterial 
host, and a lysate was prepared. Partially purified 
fractions containing each fusion protein were 
subjected to phosphorylation with the catalytic 
subunit of the cAMP-dependent protein ' kinase using 
15 [7- P]ATP as a (labeled) modification donor. The 
FEK-fusion proteins were labeled to high specific 
activity to give [ 32 P] -labeled FEK-fusion proteins 
(which are also [ 32 P J -labeled FLAG-fusion proteins). 

Note that vectors of the present invention 
related to pGTK- and pFEK- vectors can be constructed" 
lacking the cleavable linker sequence. These vectors 
would have a pGK- or P FK- prefix. A fusion protein 
comprising a selected polypeptide expressed from a 
PGK- or pFK-vector is referred to as pGK-fusion 
protein or pFK-fusion protein, respectively. 

A variety of commercially available vectors 
comprising an affinity ligand and cleavable linker 
are available- which can be modified by the 
introduction of a modification site (e.g., a 
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phosphorylation site) , and where required, one or 
more restriction sites for the in-frame insertion of 
a selected polypeptide using recombinant DNA 
techniques. For example, (1) the pGEX-3X vector 
(Pharmacia) comprising a GST affinity ligand and 
Factor Xa cleavable linker, (2) the pMAL-C vector 
(Biolabs) comprising a malB affinity ligand and 
Factor Xa cleavable linker, (3, 4) the protein A gene 
fusion expression vectors, pRIT2T and pRIT5 
(Pharmacia), comprising a protein A affinity ligand, 
(5, 6) as well as pDS and pQE-vectors (Qiagen) , 
comprising a histidine hexamer affinity ligand, could 
be modified in this way. The vector pRIT2T, vector 
PRIT5, pDS and pQE-vectors, could be further modified 
by the insertion of a sequence encoding a cleavable 
linker. Convenient affinity matrices for fusion 
proteins encoded by the foregoing were discussed 
above . 



Methods of Producing a Modified Fus ion Protein 

20 " ' 

The present invention further relates to methods 

for producing a modified fusion protein. In 
particular, a rapid and convenient method for 
labeling a fusion protein using a radiolabel or 
non-radioactive label donated by [ 7 -labeled ] ATP is 

23 provided. in addition, the methods used are mild; a 
feature which is important for preservation of the 
structure and biological activity (e.g., binding, 
antigenicity, activity) of the components of the 
fusion protein, and of the selected polypeptide in 

30 particular. 
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For example, fusion proteins of the present 
invention comprising an affinity ligand portion and a 
selected polypeptide portion are expressed in a host 
cell carrying a vector of the present invention. The 
host cells are propagated under conditions which 
permit expression of the vector. For example, 
expression from the particular pGTK- and pFEK- 
vectors described in the Examples requires induction 
by iptg. The host cells are ly Sed using known 
techniques to obtain a lysate containing the fusion 
protein. 

In one embodiment, the lysate can be crudely 
fractionated as in Example 5 and directly labeled in 
the presence of [7 -labeled] ATP (e.g., [7 - 32 P]ATP) , 
and a cAMP-dependent protein kinase, such as the 
catalytic subunit of the cAMP-dependent protein 
kinase from bovine heart muscle. A suitable 
formulation for a kinase reaction buffer and reaction 
stop buffer is given below. 

In another embodiment, the fusion protein is 
modified (e.g., phosphorylated) while bound to an 
affinity matrix. The fusion protein present in the 
lysate is captured on an affinity matrix. This step 
^ is carried out by contacting the lysate with an 

appropriate affinity matrix (i.e., an affinity matrix 
comprising a specific binding partner of the affinity 
Ugand present in the fusion protein), under 
conditions which permit binding of the affinity 
ligand portion of the fusion protein to the affinity 
matrix. Suitable conditions for a variety of 
affinity ligands and matrices are known in the art 
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For example, GST-fusion proteins (e.g., a 
GTK-fusion) can be captured on immobilized 
glutathione as an affinity matrix. As shown in 
Example 2, the GST-fusion protein can be captured on 

05 the affinity matrix in the presence of a wash buffer, 
(which was also used as the lysis buffer) . The 
components of the wash buffer permit binding of the 
GST portion of the fusion proteins to the matrix via 
the specific binding partner (glutathione) . In one 

10 embodiment, the wash buffer comprises a buffer such 
as Tris, salt (e.g., NaCl) , a chelator (e.g., EDTA) , 
and a non-ionic detergent such as nonidet P-40. 

As discussed above, FEK-fusion proteins can be 
captured on an anti-FLAG antibody affinity matrix. 

15 When the FLAG peptide is internal (e.g., not 

immediately at the N-terminus, as in pAR(aRI) 59/60) , 
anti-FLAG M2 antibody can be used as the specific 
binding partner. Suitable conditions and 
formulations (e.g., for wash buffer and elution 

20 buffer) for anti-FLAG M2 affinity chromatography can 
be obtained from International Biotechnologies, Inc. 

Once bound, the fusion proteins can then be 
washed in order to remove unbound material (e.g., 
contaminating proteins other than the fusion 

25 

protein) . The wash buffers described above are 
suitable for this purpose. 

The affinity matrix with bound fusion protein 
attached is then equilibrated (contacted) with a 
reaction buffer suitable for the modification 
30 reaction. In a phosphorylation reaction, for 
example, a kinase buffer such as HMK buffer is 
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Tell ' ^ PartiCUl « "action buffer „ iU 

buffers £ " the . ki " aSe ^"able reaction 

; V " lety ° f ="aracteriz«o kinases are 

05 lZ T h reac " on buffer - suitaw « *« 

«th the cAMP-dependent protein kinase fro. bovine 
heart muscle (HMK) , comprises (1) . buffering agent 
such as Tris, (2) a reducing agent such as 
dlthiothreitol (dtt). ,3, a salt such as Nad. and 

<° on MIT" V r 9nCSiU ™ ° r ° thSr =PP-Pri«e 

senarat ^ ? V AUh ° U9h ^ ° TT e « * •««> 

separately. lt is present in the kinase reaction 

buffer. m addition, a modification enzyme and 

Phosoh C r" d0n0r added - ^ °" e P«"=«« 
Phosphorylation reaction, a protein kinase prepara- 
tion coding the HMK enzyme is added, and 
abe ,d )ATP . Typicauy , the prMein J 

«e el!" 8 SUi " blS ^ior to addition. lB 

the examples, a variety of fusion proteins were 

«• ;eac t C L e „ n " y f r di ° labeled ^ using „„k 

C P !r ' ! e catalytic subunit ° f tha ™* 

n P]ATP as a modification donor. 

of thrH C ° nditi ° nS a PPr=Priate for phosphorylation 
of the bound fusion protein to produce a bound 

» with th <e ■ 9 " radi0labe1 ^' fusion protein win vary 
with the protein kinase selected. TypicaUy. the 
steps subsequent to culturin, the host cell are 
carried out at « . c . However, phosphorylation 
reactzons with the catalytic subunit of the 

muscle, have previously been carried out at 37 -c 

in the present method, phosphorylation can be carried 
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out at 37 °C. However, in another embodiment of -the 
present invention, efficient phosphorylation is 
carried out at 4 °C. This temperature can preserve 
the activity of fusion proteins, and of the selected 

05 polypeptide portion in particular. 

Optionally, the reaction can be quenched by 
addition of a stop buffer comprising a component 
capable of inhibiting the modification reaction. The 
component capable of inhibiting the reaction will 

10 vary with the modification enzyme, but can include 
inhibitors (e.g., reaction products, competitors), 
chelators, or other agents which stop the reaction. 
It can be desirable to stop a reaction when, for 
example, in subsequent steps, undesired modifications 

15 can occur. However, wash steps can serve to remove 
modification enzymes . 

For example, in a phosphorylation reaction with 
cAMP-dependent protein kinase, an HMK stop buffer can 
be added, comprising a buffer such as sodium phos- 

20 phate, a reaction inhibitor such as sodium pyrophos- 
phate, and a chelator such as EDTA . HMK stop buffer 
can optionally contain a carrier such as bovine serum 
albumin or glycerol. As a reaction product of a 
phosphorylation reaction, sodium pyrophosphate 

15 inhibits the kinase reaction. In addition, chelating 
2 + 

Mg ions inhibits the reaction. 

An advantage of the present method is that 
unincorporated label is easily removed by washing the 
affinity matrix with bound labeled fusion protein. 
10 The wash buffers described above, which permit 

binding of the affinity ligand to the affinity matrix 



SUBSTITUTE SHEET 



WO 93/03157 



PCT/US92/06187 



-32- 



via the specific binding partner can be used in this 
step, other methods for labeling proteins with the 
HMK enzyme have relied upon extensive dialysis to 
remove unincorporated label (e.g., Zhao, X.-X. et 

Analvt. BiochPm. 178 : 342-347 (1989)). The 
removal of unincorporated label by washing is 
advantageous because extensive dialysis can 
compromise the function of some proteins and results 
in the production of large quantities of contaminated 
dialysis buffer. 

If desired, the modified fusion protein may be 
isolated for use by releasing the fusion protein from 
the affinity matrix by washing with a suitable 
elution buffer. Elution buffers were discussed 
above. For example, a labeled GST-fusion protein 
such as a GTK-fusion protein can be eluted from 
immobilized glutathione with an elution buffer 
comprising glutathione (e.g., reduced glutathione), 
in one embodiment, an elution buffer comprising 
reduced glutathione, a buffer such as Tris, and a 
salt such as NaCl is used to release labeled 
GST-fusion .proteins from the affinity matrix. 

In the case where a cleavable linker is present, 
after elution, the fusion protein can be cleaved in 
vitro to free the modified (e.g., labeled) 
polypeptide portion. Cleavage is accomplished by 
contacting the fusion protein with an appropriate 
specific protease under conditions which permit the 
cleavage reaction. Such conditions for cleavage by 
enterokinase, thrombin and factor Xa, for example 
are known. The affinity ligand portion or any 
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uncleaved product can be removed by adsorption on the 
appropriate affinity matrix in the presence of wash 
buffer for example. The modified or labeled selected 
polypeptide portion can be recovered in the 
05 supernatant. 

Optionally, prior to elution from the affinity 
matrix, the modified polypeptide portion can be 
released from the affinity ligand portion by cleaving 
the modified fusion protein at the cleavable linker. 

10 An example protocol for cleavage by a specific 

protease on a column is provided in Abath, F. and A. 
Simpson, Biotechniaues 10: 178 (1991), the teachings 
of which are herein incorporated by reference. 

Smith (EP 0,293,249; PCT/AU88 / 00164 ; NZ 224,663) 

15 and Smith and Johnson ( Gene 67: 31-40 .1988)) also 
describe conditions for capture of GST-fusion 
proteins on immobilized glutathione, washes, elution 
and cleavage protocols. 

In one embodiment of the present invention, a 

20 kit for preparing a labeled fusion protein is 

provided, comprising (1) an affinity matrix to which 
a portion of the fusion protein can bind, (2) a wash 
buffer which permits binding of .the fusion protein to 
the affinity matrix, (3) a modification reaction 

25 buffer such as a protein kinase reaction buffer, and 
(4) a modification enzyme preparation such as a 
protein kinase preparation. Optionally, the kit can 
contain a reaction stop buffer or an elution buffer 
comprising a release component. The nature of these 

30 elements has been explained above in more detail. 

In another embodiment, a kit can comprise a 
vector of the present invention (e.g., pGEX-2TK or 
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Z 7 6 '' The vector can be • 

(lTth«r h e ', the COTPri " S eaCh <* th. elements 

°> eLel f r « J' "« deS « ibed — ■ « « 
.» h I reaCtl ° n st °P bu «er. m other 

Cct'at; : ™ 0difi ""°" — -«* as ,-laheled 
can also be included in the kit 

Prepa ra n uo„T Ph ° ryUti °" * P ™tein kinase 

" ln the prot C °" Prisi ^ • Proton kinase, 

in the Toll ot k ;™\^™^ «e kinase 

use. necessary prior to 

=r , a f „L;;:::Li^i:i: , : n ? - 
: p :::; d throu9h the m ^ — - il- 

tne labeling procedurp t„ ie 
i«™«K-i- y proceaure - In one embodiment, 

9e unit for capture of GST-fusion proteins. 
Uses of the Present Invention 

-pres^To/L? «- 

host cell These f a "° UntS ° f ' £USi ° n Pr " ein in * 
These fusion proteins Mn k« 

labeled «.,,•„„ ns can be conveniently 

uses L r meth ° dS and have many 

uses m research, diagnostic and therapeutic 
applications. peutic 
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For example, radiolabeled proteins are useful in 
imaging. In one embodiment, radiolabeled fusion 
protein or radiolabeled selected polypeptide portion 
can be used to locate cells carrying a ligand 
recognized by the selected polypeptide (e.g., a 
receptor, cell surface component, or antigen) . For 
example, a growth factor can be used as the selected 
polypeptide, and a labeled fusion protein of the 
present invention comprising the growth factor can be 
used to detect cells carrying a receptor for the 
growth factor. 

Similarly, incorporation of a suitable 
radiolabel permits the use of fusion proteins of the 
present invention in targeted radiotherapy. in other 
words, the selected polypeptide (e.g., a hormone, 
growth factor, antibody) present in the fusion 
protein can seek out specific cells and damage or 
destroy them due to the attached radiolabel. An 
isotope with the desired energy and half-life can be 
selected for this purpose. other labels incorporated 
by methods of the present invention and capable of 
killing cells can also be used. 

For example, a labeled antibody or portion 
thereof (e.g., Fy , Fab, etc.) produced by the method 
of the present invention could be used in this 
manner. Such fusion proteins would be useful for 
many other applications as well. The term antibody 
as used herein refers to antibodies such as chimeric 
antibodies, single chain antibodies, bifunctional 
antibodies and other antibody variants, including 
individual chains (e.g., a heavy chain). A variety 
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of methods for expressing immunoglobulins are 
available (see e.g., Skerra# A . and A Pluckthun; 
Science 240: 1038-1041 (1988)). 

in one embodiment, antibody chains are expressed 
as fusion proteins of the present invention in E 
£oll. A signal sequence (e.g., ompA, phoA) is f^sed 
to the N-terminus of an antibody chain, which is 
followed by at least one modification recognition 
sequence, an optional cleavable linker, and an 
affinity i igand . 0n expression in E. colj the 
signal sequence is cleaved from the fusion protein 
freeing the amino terminus of the antibody chain, and 
preserving the binding function of the variable 
region. Fusion genes encoding both chains can be 
expressed in this manner from two vectors or a single 
vector encoding two fusion genes. Alternatively 
either the heavy or light chain could be expressed as 
fusion protein of the present invention and the 
20 c C ~ ent3 ^ Chai " could be expressed in the same 
cell using typical antibody expression vectors 

In addition, as shown herein, labeled fusion 
proteins of the present invention can be used as 
probes. As shown in Example 4, labeled fusion 
proteins such as 32 p-rTv 

as nrnn , GTK-fusion proteins, can be used 

probes for screening expression libraries for 
Proteins capable of binding the selected polypeptide. 
For example, a fusion protein of the present 
invention comprising a selected polypeptide is 
expressed, p Urified by virtue Qf ^ >f 

and specifically labeled at the modification 
recognition sequence. The labeled, homogeneous 
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fusion protein is then incubated with filters on 
which proteins expressed by individual plagues of a 
target library have been immobilized . Following this 
hybridization step, the filters are washed, and 

05 processed for identification (i.e., by detection of 
the label) of plagues which produce recombinant 
proteins capable of specifically interacting with the 
labeled fusion protein probe. 

In particular, a cDNA encoding a portion (the E7 

10 binding domain) of the retinoblastoma susceptibility 
gene product (pRB) was prepared as a GTK-fusion 
protein and used to screen a library for proteins 
which interact with pRB. A number of positive clones 
were isolated. One of these clones, designated 

15 RBAP1, displays properties expected of a cellular 
protein capable of interacting with pRB. Labeled 
fusion proteins of the present invention comprising a 
selected polypeptide can also be used as probes in 
Western blot formats (Example A). 

20 In addition, as shown in Example 4, labeled 

fusion proteins of the present invention can be used 
to identify proteins present in complex mixtures 
(e.g., whole cell extracts), which are capable of 
interacting with the selected polypeptide. The 

25 

fusion proteins can be used in protocols to isolate 
the interacting proteins. For example, the 
interacting proteins can be captured by the selected 
polypeptide portion of a labeled fusion protein which 
is bound to an affinity matrix. Subsequent cleavage 
3° at a cleavable linker could release the complex of 
the interacting protein and a labeled selected 
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polypeptide portion for further analysis. 
Alternatively, a purified labeled fusion protein 
probe can also be used to determine the amount of or 
^ to detect an interacting protein or other substance 
in a complex mixture using immunoassay techniques, 
for example. 

In one embodiment, the binding of a specifically 
interacting protein (e.g., an antigen, hormone, or 
other substance to an immobilized (e.g., bound to an 
affinity matrix) fusion protein of the present 
invention can induce a change in the attributes of 
the bound fusion protein. other substances include 
but are not limited to agents such as a drug, toxin 
substrate or other ligands capable of interacting 
with the selected polypeptide portion of the fusion 
Protein. The difference between the unbound versus 
the bound state could be taken advantage of, as for 
example, in an assay format. For example, the 
2q presence of the interacting protein or other 

substance in a complex mixture (e.g., blood) could be' 
monitored. For example, binding of an interacting 
Protein could induce a conformational change that 
shields the labeling (e.g. phosphorylation) site. 
^ The complex mixture containing an interacting protein 
S C ° ntacted "it* the fusion protein, which is bound 
to the affinity matrix. The combination is washed 
equilibrated with a suitable reaction buffer and ' 
subjected to the labeling procedure. The extent of 
labeling is compared to a control which lacks the 
interacting polypeptide and the presence of the 
interacting protein in the complex mixture is 
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indicated by inhibition of labeling. By comparison 
with the degree of inhibition achieved by a standard, 
such a procedure can also provide information 
regarding the quantity of interacting protein which 

05 is present. 

The vectors and methods of the present invention 
can facilitate the characterization of cDNA clones. 
For example, an uncharacterized cDNA can be inserted 
into a vector of the present invention allowing rapid 

10 purification of the protein, without specific 

knowledge of its properties. The protein can be 
labeled with the maintenance of biological activity, 
and the labeled fusion protein can be used to detect 
potential interactions with other cellular proteins. 

15 The present invention will now be illustrated by 

the following Examples, which are not intended to be 
limiting in any way. 

Introduction to Examples 1-4 

Recently it has been demonstrated that the 

20 adenovirus E1A protein, SV40 and polyomaviral T 

antigens, and the human papilloma virus E7 protein 
can bind to the retinoblastoma susceptibility gene 
product, pRB. Neither these viruses nor these 
proteins are thought to be closely related to one 

25 another, yet each of these proteins shares a short, 
homologous, colinear, transforming sequence, typified 
by E1A conserved region 2 (CR2), which appears to be 
responsible for high affinity binding to pRB. The 
region of pRB which interacts with this motif has 
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been capped. To date, all spontaneously occurring 
loss of function, pRB stations which do not grossly 
concise pRB stability, map to this region. This 
led to the hypothesis that pRB , in the course of 
regulating cell growth, must interact with one or 
more cellular proteins bearing a sequence 
structurally resembling the viral pRB-binding motif 
If such a model were correct, one would predict that 
PRB ^activation might be achieved either as a result 
of competing complex formation with one of the above 
vxral oncoproteins, or as a result of RB mutations 
affecting the T/ElA/E7-binding domain. 
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Example 1 

Construction of pG£X-2TK 

Two synthetic oligonucleotides were designed, 
which, on annealing, encode a cAMP-dependent protein 
°5 kinase recognition site and three restriction sites. 
Codon utilization for the synthetic duplex was based 
on the prokaryotic codon utilization data of 
(Grantham, R. et al . , Nucleic Acids. Res . 9: r43-r74 
(1981) ) . 

10 The oligonucleotides were synthesized using 

standard techniques and were annealed in vitro (Gait, 
M.C.J. , (1984) Oligonucleotide Synthesis:. A 
Practical Approach , (I.R.L. Press; Oxford)). The 
resulting duplex has 5' -overhangs compatible with 

15 BamHI- and EcoRI-cut DNA and is shown below: 

ArgArgAlaSerVal 
5 ' -GATCTCGTCGTGCATCTGTTGGATCCCCGGG 

AGCAGCACGTAGACAACCTAGGGGCCCTTAA-5 ' 

Plasmid pGEX-2T (Pharmacia) was linearized with BamHI 
and EcoRI;' and the vector fragment was ligated to the 
synthetic duplex to make plasmid pGEX-2TK. 
Incorporation of the duplex into pGEX-2T was 
confirmed by restriction and DNA sequence analysis. 
DNA sequencing was performed using a Sequenase 2.0 
kit (United States Biochemical Corp.) with the 
protocol provided by the manufacturer. The. structure 
of pGEX-2TK in the region surrounding the kinase site 
is shown in Figure l. 
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As can be seen in Figure l, the incorporation of 
the synthetic duplex DNA into pGEX-2T resulted in the 
insertion of codons for a cAMP-dependent protein 
kinase recognition sequence, Arg-Arg-Ala-Ser-Val 
immediately downstream of the thrombin recognition 
site present in parent plamid pGEX-2T. In addition 
the synthetic duplex encoded a multiple cloning site 
(MCS) downstream of the kinase recognition sequence 
such that the insertion of the duplex led to the 
regeneration of the MCS of the parent plasmid, with 
restriction sites for BamHI , smal. and EcoRI . As in 
^he parent plasmid, the MCS is followed by a sequence 
with stop codons in all three reading frames. 

Example 2 

Generation of 32 P - GS T Fu S i on Proteins 

Construction of pG EX-2TK Vect ors Encoding pn Proteins 

RB cDNAs encoding residues 379-792 or residues 
379-928 of the retinoblastoma susceptibility gene 
product, both of which span the T/ElA-binding region 
were subcloned into P GEX-2TK. In addition, an RB ' 
CDNA encoding residues 379-928, having a naturally 
occurring, loss of function, RB point mutation 
(379-928;706F), which is known to abrogate T/E1A 
binding, was subcloned into pGEX-2TK.. These pGEX-2TK 
recombinants are named pGTK-RB(379-792) 
pGTK-RB(379-928) and pGTK-RB ( 379-928 ; 706F) ■ 
respectively. The encoded fusion proteins 'are named 
GTK-RB(379-792), GTK-RB ( 3 79-928 ) and 
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GTK-RB(379-928;706F) , respectively. These RB fusion 
proteins are also referred to as GST-RB fusion 
proteins or RB fusion proreins. 

Two of the RB cDNAs had been generated in an 

05 earlier study by Kaelin et al . (Kaelin, W.G. et al . , 
Mol. Cell. Biol . 10(7) : 3761-3769 (1990)), 
incorporated herein by reference, using the 
polymerase chain reaction. The RB cDNA inserted into 
PGEX-2TK to make pGTK-RB ( 379-928 ) corresponds to RB 

10 deletion RB dl 1-378 (Kaelin, W.G. et al . , Mol. Cell. 
Biol. 10(7) : 3761-3769 (1990)). The RB cDNA inserted 
into pGEX-2TK to make pGTK-RB ( 379-792 ) corresponds to 
RB deletion RB dl 1-378 ; 793-928 . Residue 379 is a 
methionine residue. 

15 Each amplimer used to generate the cDNAs 

contained a BamHI site, such that the resulting PCR 
product, upon digestion with BamHI, could be ligated 
in frame into the unique BamHI site in pGEX-2T 
(Pharmacia). The 3' amplimer contained a TGA stop 

20 codon as well. The PCR fragments encoding these two " 
cDNAs were each cleaved with BamHI and cloned into 
PGEX-2T, which had been linearized with BamHI, and 
treated with calf intestinal phosphatase. The 
resulting constructs were named pGT-RB ( 379-928 ) and 

25 pGT-RB(379-792) . The cDNAs were cleaved from the 
latter constructs using BamHI, and were each 
subcloned into pGEX-2TK, which had been cleaved with 
BamHI. The resulting plasmids are named 
pGTK-RB(379-928) and pGTK-RB ( 379-792 ) . Two 

30 additional amino acids were incorporated into the 
sequence at the BamHI site due to the structure of 
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the PC R priors used to clone both cDNAs , such that 
the ammo acid sequence at the junction of the 
Phosphorylation (kinase) site and the first RB 
residue (Met 37g ) is: 

(-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met -, 

The mutant RB cDNA and referred to herein 'as 
either RB(379-928;706F) or RB (379-92 8 ;pm7 06) 

corresponds to the cDNA designated (379-928; P m 706, 
cons t by Kflelin ^ ^^^^ ^ ^ 

£211 il: 521-532 (1991),, also incorporated^^ by 
reference. The mutant cDKA fragment was generated by 
PCR of reverse-transcribed mRNA from cells containing 
the mutation. The PCR product was cleaved at the 

unique Ncol and BsmI sites in the rb no„« , u- w 
15 tho 7ncr ne RB ^ene (which span 

the 706F mutation,, and ligated into pGT-RB (379-928 , 
from which the wild type RB cDNA NcoI-BsmI segment ' 
had been excised. The resulting plasmid, 
pGT-RB(379-928;706r, ( encoded the 379-928;706F cDNA . 

^0 Zl T Cl6aVed With B3mHI t0 "1— • the 

cDNA, and the cDNA fragment was subcloned into 

PGEX-2TK which had been cleaved with BamHI to make 
PGTK-R B (379-928;706F). The amino acid sequence at 
the .unction of the phosphorylation (kinase) site and 
^ the first RB residue (Met^) is also 

(-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met -) 

The products encoded by all three cDnIs have 

svlo r 5 ^" PrSViOUSly f ° r the - -bility to bind to 
SV 0 T antigen, the adenovirus E1A gene product, and 

putative cellular RB-binding proteins who. 
30 =<- nrrv y proteins, when expressed 

as PGEX-2T encoded GST fusion proteins. The fusion 
proteins encoded by pGT-RB (3 79-928) and 
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pGT-RB(379-792) were able to bind to T antigen, El A, 
and the putative cellular RB-binding proteins, while 
the mutant protein encoded by pGT-RB ( 379-928 ; 706F) 
did not (Kaelin, W.G. et al , f Cell 64: 521-532 
05 (1991)). 

Expression and Purification of GST-Fusion Proteins 

The expression of the pGEX-2TK-encoded protein 
and recombinant pGEX-2TK-encoded GST fusion proteins 

10 in E. coli (DH5q; Bethesda Research Laboratories), 
and the subsequent recovery of the proteins on 
glutathione sepharose was carried out essentially as 
described by Smith and Johnson (Smith, D.B. and 
Johnson, K.S., Gene 67: 31-40 (1988); Kaelin, W.G. et 

13 al. , Cell 64: 521-532 (1991)). Fresh overnight 
cultures of E. coli DH5a , transformed with either 
pGEX-2TK or pGEX-2TK recombinants, were diluted 1:10 
in Luria-Bertani (LB) medium containing ampicillin 
(100 vq/ml) and incubated for a total of 5 hours at 

20 37 °c, with shaking. After 1.0 hour of growth at 37 
°C, isopropyl-/3-D-thiogalactopyranoside (IPTG; 
Bethesda Research Laboratories) was added to a final 
concentration of 0.1 mM. 

For analysis of total bacterial protein content, 

25 aliquots of each bacterial culture were pelleted in a 
microcentrifuge, were boiled in urea-SDS cracking 
buffer (0.01 M sodium phosphate [pH 7.2], 1% 
£-mercaptoethanol , 1% SDS , and 8M urea), and were 
loaded onto an SDS-polyacry lamide gel. Proteins were 

30 visualized by Coomassie blue staining. 

For phosphorylation of protein and/or protein 
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recovery using glutathione sepharose (Pnarinacia) 
bacteraal cultures were pelleted at 5000 x g for 5 
»"> « 4 -c, end resuspended 1 /10 the original 

vere then lysed on ice by » U d sonicaticn and 
suited to centrifugation at xo.000 x g for 5 min 

< c. The clarified bacterial sonicates 
contain^ the P CEX- 2T K-encoded protein or relevant 

< C w lt h glutathione sepharose (20-30 „1 /Bl 

(Glut'tL 1 SOniC " e ' • 9lUt « hi °« "pharose beads 

washed -h Ph3rOSe PharMC "» had b «" 

" Isee above" 'T "* 1:1 «» 

1" prior t SUP me " ted Uith °- 54 "° n - ,at 

were also ™ S ™ d steps 

were also carried out at 4«c. 

PhosDhorvI*i-ir,„ Q f Protein* 
20 The sepharose beads, with bound protein, were ' 

then washed three ti.es with NETN followed by one 

H.«. „ * „,« The supe „ atant 

» I ; o 3G d needlS SePh " OSe - "suspended 

unit, l ! .r 1 ™" ° f " HMK bUffer raining i 

" thS C " a1 ^" «*»it of cAMP-depenLt 
rot i (slgma che]niMi ^ ^ ^ ^P 

England w ^ <60 °° "'""'^ 10 N *« 

Engird Nuclear) . and i « DTT. Th e Kinase reaction 

was allowed to proceed for 10 mi „utes at < -c with 

Per.odic agitation of the sepharose to .aintain 
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suspension. The reaction was terminated by the 
addition of 1.0 ml of HMK stop buffer (10 mM Na 
Phosphate [pH 8.0], 10 mM Na Pyrophosphate, 10 n\M 
EDTA, 1 mg/ml bovine serum albumin) . Following a 
05 brief spin in a microcentrifuge, the supernatant was 
again removed using a 23 G needle and the sepharose 
was washed 5 times with NETN to remove any 
unincorporated label. Incorporation of label can be 
determined while the protein is attached to the bead. 

10 Elution With Reduced Glutathione 

For further studies, the fusion protein can be 
eluted from the beads. After the final wash the 
residual supernatant was aspirated using a 23 G 
needle and the labeled or unlabeled GST fusion 

15 protein was eluted by rocking the sepharose for 10-15 
minutes in 10-50 bead volumes of 20 mM reduced 
glutathione, 100 mM Tris [pH 8.0], 120 mM NaCl. 

32 

Effect of HMK Sequence on P-Incorporat ion In Vitro 
The ability of the Arg-Arg-Ala-Ser-Val sequence 

20 to convert the GST fusion proteins into , substrates 
for the catalytic subunit of cAMP dependent protein 
kinase was tested. GST fusion proteins encoded by 
the pGEX-2TK recombinants, as well as the 
corresponding pGEX-2T constructs, were overexpressed 

25 in bacteria and recovered on glutathione sepharose as 

described above. As described in more detail above, 

the purified fusion proteins, while still 

non-covalently bound to glutathione sepharose, were 
3 2 

incubated with P-7-ATP and the purified catalytic 
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at ° t CAMP - e P-^nt protein Kinase for 30 min 

The s h " 3 kinaSS St ° P bUffer -d.d 

v h It" 5 " With b ° Und Pr ° tei - W - -ens ively 
03 c an el 7°" ^'^ated transferred to 

tubes ' and - — 

The GST-rb fusion proteins were readily 
Phosphorylated in_^itro, provided that the Kinase 
recognition sequence was present r- 
J 0 effect nf • ' e wa s present. Figure 2 shows the 
effect of inserting an HMK sequence on 32 P 

fusion proteins. In particular, GST-RB (379-792 , and 
CST-RB ( 379-792; Pm7 o 6) f usion proteins ^ ^ 
encoded by pGEx . 2TK and ^ 

15 sequence (hatched bars) showed c < 

. , ' snowed significant 

incorporation of 32 p t „ 

1 p * In contrast, GST-rb f 379-705 > 

o:: a T T ^379 " 792;pn706, fusi - — ^ » - - e 

bars we " laC ^ d ^ ™ K t««a 

, 0 " Sre " ot "9nificantly radiolabeled. 

glutathione sepharose beads, and were subjected t 
Phosphorylation U si„ 9 the si « protocol I 

'47, IT* haV4ng 3 "» ^beled „ ith 

' Ul C P»)' "hile the GST-RB<379-92»> f„„- 

T:;:iT:r from p — - ^ • 

te was not comparably phosphorylated (1 x 10 4 CDm , 
Similarly a pst Dn,-,-,« ~ 1 A i0 cpm) . 

riy, a GST-RB (379-92 8 ) ;pm706 fusion protein 
expressed from. P GEX-2TK and having a kina*! ! 
labeled with 32 P f3 v in 6 9 3 kl " aSe Slte wa * 

P (3 x 10 cpm), while the 

Phosphorylated (1 x 10 4 cpn) w " " 0t "mparably 
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Example 3 

Structural Integrity of Fusion Proteins 

Expression of Fusion Proteins in E. coli 

As discussed above, the GST-RB fusion proteins 

°5 expressed from pGEX-2T-derived plasmids 

pGT-RB (379-928) and pGT-RB ( 379-792 ) were able to bind 
to T antigen, E1A, and the putative cellular 
RB-binding proteins, while the mutant fusion protein 
encoded by pGT-RB(379-928 ;706F) did not (Kaelin, W.G. 

10 et al . , Cell 64: 521-532 (1991)). This observation 
suggests that the binding function and structure of 
the RB proteins is not grossly disturbed when they 
are expressed as part of a fusion protein with GST. 
It was further determined that the insertion of 

15 the Arg-Arg-Ala-Ser-Val (RRASV) sequence did not 
grossly alter bacterial expression of GST-RB fusion 
proteins expressed from pGEX-2TK. For this 
determination, whole cell lysates of IPTG-induced 
cultures were prepared and total bacterial protein 

20 content was analyzed directly on SDS-polyacrylamide 
gels as described above. Protein from E. coli 
transformed with pGTK-RB ( 379-928 ) , pGTK-RB (379-792) , 
or pGTK-RB(379-928;pm706) were analyzed. The band 
intensity for each fusion protein was comparable to 

25 

that for the corresponding construct lacking the 
phosphorylation site. Each fusion protein had an 
apparent molecular weight consistent with the size of 
the pRB fragment encoded by the RB cDNA insert and 
the 26 kD GST polypeptide. 
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PHrification of Fusion Prrtrin.- 

prot e The SUbSeqUent P^^tion of gst-Rb fusion 
Proteus expressed from pGEX . 2TK was ^ 

the insertion of the Erasv sequence. Bacteria" 

°v ach ° r the w ere ::i 

sepharo s : \ to Athlon. 

seph arose chroraatography ^ descr . bed 

three":!: ^ ^ ^ P — «« -en wLeo 

10 e luted b h T ^ ^ b ° Un ' d P " tei " s were 

eluted by boiling in SDS e b 

glycerol, 62 mM Tris fpH 6 81) Th. . 

subieci-oH «. , b - 8 J)- The supernatant was 

surrjected to electrophoresis on » m« 

oe] an . F ls on a 10 < Polyacrylamide 

pu i by cooraassie biue stai ^ 

5 GTK 1 ^tensity of the bands for 

GTK-fusion proteins expressed in p 

sestet ms ~- :L 

lackino Z I corresponding constructs 

lacking the kinase site (i.e., P GT-R B (379-928 ) 
> PCT- RB (379-792), or PGT-RB (379-928 ;pin706^, - 
respectively) 

The following materials and methods were used in 
subsequent experiments. 

£ells__and culture Conditions 

»«-»« <»..„ L : 'en" • f^ an S^.^ 

«" line transferred by a fr.^t ""^ 

» <=ra*a m . r. L . It al j Gen v T""*"" 

- — ' ' y - G en. Virol . 35. 
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59-72 (1977)), were grown in Dulbecco's modified 
Eagle's medium (DMEM) with 10% fetal calf serum 
(Gibco) . AJcata cells were grown in RPMI or DMEM, 
with 10% fetal calf serum (Gibco) . C57B1/6 primary 
mouse embryo fibroblasts (MEF) lines expressing 
either wild-type (MEF*Tex) or mutant (MEF*K1) forms 
of T antigen were grown in Dulbecco's modified 
Eagle's medium (DMEM) with 10% fetal calf serum 
(Gibco) and G418 (150 ug/ml) (Ewen, M.E. et al . , Cell 
58: 257-267 (1989)). All cells were grown at 37»C in 
a humidified, 10% C0 2 -containing atmosphere. 
Radioisotopic labelling of cells and preparation of 
cell lysates was as described previously (Kaelin, 
W.G. et al . , Cell 64; 521-532 (1991)). 

15 Antibodies 

Tissue culture supernatants were the source of 
monoclonal antibodies PAb 419 and M73 (Harlow, E. et 
al., J. Virol. 39: 861-869 (1981); Harlow E. et al ~ 
J. Virol. 55: 533-546 (1985)). The use of these 
antibodies for immunoprecipitation and western 
blotting was described previously (Kaelin, W.G. et 
al., Cell £4: 521-532 (1991)), except that electro- 
phoretic transfer of proteins to nitrocellulose was 
performed without the addition of methanol to the 
25 transfer buffer. 

Binding of Fus ion Proteins to T Antigen and E1A 

It was also determined that the insertion of the 
RRASV sequence did not demonstrably alter the binding 
behavior of GST-RB fusion proteins, GTK-RB(379-928) , 
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GTK-RB (379-792 ) , and GTK-RB (379-928 ;706F) , expressed 
from PGEX-2TK. The pRB binding assays for T antigen 
and ElA binding were performed essentially as 
described by Kaelin et al . (Kaelin et_al. , cell 64: 
521-532 (1991)). 

Fusion proteins GTK-RB (379-928) and 
GTK-RB (379-792) (bound to glutathione-sepharose) 
retained the ability to bind to SV4 0 T antigen or 
adenovirus ElA in solution. m contrast, no 
significant binding of T antigen or ElA to the mutant 
RB fusion, GTK-RB(379-928;706F), or GST expressed 
from pGEX-2TK was observed under the same conditions. 

Integrity of Phosphoryl *ted Fusion Proteins 

To determine whether the kinase reaction led to 
a significant alteration in the RB T/ElA-binding 
region of the GTK-RB fusion proteins, the following 
experiment was performed. Whole cell lysates 
containing wild-type T antigen (MEF*Tex cells) , the 
RB-binding and transformation defective T antigen 
mutant Kl (MEF*K1 cells), or ElA (293 cells) were re- 
solved by SDS-polyacrylamide. gel electrophoresis and 
transferred to nitrocellulose filters (See protocols 
in Example 4). GST-RB fusion proteins were kinased 
i£Lvxtro and eluted from the glutathione sepharose in 
the presence of reduced glutathione as described 
above. The eluted labeled protein was incubated 
overnight with nitrocellulose strips cut from the 
filters. The filters were then washed and subjected 
to autoradiography. Hybridization conditions and 
washes were as described below in Example 4, 
Hybridization of Filters. 



iR.srrruTE SHEET 



WO 93/03157 



PCT/US92/06187 



-53- 

Similar to the unphosphorylated version, 
32 P-GTK-RB(379-792) bound to E1A and wild-type T from 
MEF*Tex cells, but not to the T mutant Kl from MEF*K1 
cells. The presence of equivalent amounts of T and 
05 Kl in this assay was confirmed by immunoblotting with 
a monoclonal antibody directed against T antigen. 

The binding of 32 P-GTK-RB ( 379-792 ) protein to 
E1A was inhibited by the presence of a synthetic 
peptide replica of the human papillomavirus E7 

10 RB-binding motif (wild-type E7 residues 16-32). In 
contrast, a point mutant derivative of this peptide 
with a glu to gin change at residue 26, which is 
known to abrogate in vitro RB binding, was inert as 
an inhibitor. In addition, 32 P-GTK-RB (379-928) , but 

15 not 32 P-GTK-RB(379-928) ;706F) , exhibited E1A binding 
in this assay. Thus, it appeared that the structural 
integrity of the T/ElA-binding regions in the GST-RB 
chimeras was preserved during the kinase procedure 
and subsequent elution. 

20 The experiments described in this example 

indicate that the expression, behavior during 
purification and structural integrity of different 
polypeptides inserted into pGEX-2TK is not grossly 
affected by insertion of a kinase sequence or by 

25 subsequent phosphorylation. 
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Example 4 




05 



Library Manigulations 

For primary screening, libraries were plated at 
PProx lmately 4Qf000 pfu/i5Q Plated at 

ndu n Qf ,. galactosidase fusion P s. 

s:on with IPTC impregnated nitrocellulose 

lt fS Perf ° rmed as ascribed by singn et al 

f^ni^l: 252 . 261 (1989)) ^Jj^' 

library manipulations including screening with 

P-labelled cDNA probes, purification of specific 
clones, preparation of recombinant p hage DNA and 
15 subcloning of cDNAs im-o <-h ' 

(Stratao J , sequencing vector pBKS 

(Stratagene) were performed using standard 
techniques. 



^famion^Qlti^ceUulose Filters f , , 
Hybndizat Ion 

P»ly cryla,* g els to nitrocellulose 
"ot analysis was carried out in 1M M olycine L 
* Tr ls( Base,. and 0.01% s DS . PIaque U JJ ^ 

Alters were placed, directly into IX HBB ( 25 m 
Hepes-KOH[pH7.„. 25 m Nacl _ 5- ^ 5 « 

KP-40 «t h out dryin, and incubated overni g nt. J h 
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and subsequent manipulations were performed at 4°C 
with gentle rocking. The filters were then denatured 
and renatured as described by Vinson et al . ( Genes 
and Dev . 2: 801-806 (1988)). Processing of multiple 
filters was done in batch. 

Following renaturation , the filters were placed 
in fresh IX HBB supplemented with 5% non-fat powdered 
milk and 0.05% NP-40, and incubated for 1 hour. The 
filters were then incubated in IX HBB supplemented 
with 1% non-fat powdered milk and 0.05% NP-40 for at 
least 30 minutes prior to hybridization. 

Hybridization of Filters 

To prepare the hybridization solution, protein 
expression in E. coli (DH5q) transformed with 
PGEX-2TK and pGEX-2TK-derived plasmids were induced 
with IPTG as described above in Example 2. The 
bacteria were then pelleted and resuspended in 1/10 
the original culture volume in Hyb 75 (20 mM Hepes 
[pH 7.7], 75 mM KC1, 0 . 1 mM EDTA, 2 . 5 mM MgCl 2 , 1 mM" . 
DTT, 0.05% NP-40). The bacterial suspension was 
sonicated with a probe type sonicator (Branson) and 
centrifuged at 10,000 x G for 5 min at 4°C. For 
western blots the clarified supernatant was used 
undiluted or diluted 1:2 with Hyb 75. For screening 
plaque lifts the clarified supernatant was diluted 
1:2-1:8 with Hyb 75 supplemented with 1% nonfat 
powdered milk. 

The relevant 32P-labeled GST fusion protein was 
added at 100,000-250,000 cpm/ml. Hybridization was 
carried out at 4°C with gentle rocking overnight, 
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after which the filters were washed three times 
(10-15 min/wash) with Hyb 75 with 1% nonfat powdered 
■Hk. The filters were then dried, covered with 
saran wrap and exposed to film at - 70 o C with fin 
intensifying screen. 



05 



Screening PreclMred Lv sates for Sp ecifically 
Interact ing Protein.! 

Previously, a series of GST-RB fusion proteins 
jo «P«««1 from pGEX-JT vectors, end non-covalently ' 
bound to glutathione sepherose, were used to search 
for cellular proteins capable of interacting, 

do"L"yv 0r , indlre " ly ' ^ PRB ""^-binding 

(Kaelrn. W.G. et_al.. cell 64:521-532 (199l)) 9 

» i r; 7 cenuiar proteins - ere ide "" fiea -"«* 

T/ElA/E7- bl nding domain. These cellular proteins 
'.il- to bind to GST-RB fusion proteins (with no „„k 
stance) derived fro, spontaneously occurring, loss 
^ of function RB nutations. m addition, the binding " 

ll Zl C :T" Pr ° teinS " GST " RB fusi °" P^teins 
the In , y " Synth " ic W". corresponding to 
the PRB-brnding/transformin, seguence found in SV40 T 
antrgen. In contrast, a point mutant derivative of 

25 the"^^' "T"" 0 ""** *° th ° "*>«*• <ound in 

Inhih t ! " defeCtive T *«ant Kl, did not 

inhibit binding. 

It appeered that one or more of these proteins 
-y fuifill the criteria for a meaningful pRB 
cellular ligand. Unfortunately, which of the 
cellular proteins bound directly to the pRB T/E1A/E7- 
blnd.ng domain could not be determined from these 
experiments. 
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To demonstrate which of these cellular proteins 
is capable of interacting directly with RB, an 
35 S-labeled WERI-Rb27 retinoblastoma cell lysate was 
prepared. The lysate was precleared by passage over 

05 glutathione sepharose which had been loaded with 

pGEX-2T-encoded GST- to remove proteins which may bind 
to GST rather than the protein of interest. GT- 
RB(379-792) fusion protein was loaded onto the 
glutathione sepharose beads, and the precleared 

10 lysate was incubated with the bound GT-RB ( 379-792 ) 
fusion protein in the presence (for strip 1) or 
absence (for strips 2-4) of wild-type E7 peptide 
(residues 16-32) . Unbound protein was removed by 
washing and the proteins were eluted by boiling in 

15 sample buffer. 

Bound proteins (directly and indirectly bound to 
the RB-bound beads) were loaded in wide wells, 
resolved by SDS-polyacry lamide gel electrophoresis, 
and transferred to nitrocellulose. The filter was 

20 then cut into adjacent strips and either subjected to 
autoradiography (strips 1 and 2) or probed with 
32 P-GTK-RB(379-792) in the presence (strip 3) or 
absence (strip 4) of the above-mentioned E7 peptide. 
Strips 3 and 4 were then washed, dried, placed under 

^ saran wrap (which greatly reduces the S signal) , 
and placed under film. 

At least two bands appeared to be capable of 
interacting directly with the 32 P-GTK-RB ( 379-792 ) 
probe in a peptide inhibitable manner. These bands 

30 were not detected by 32 P-GTK-RB ( 379-792 ) on strips in 
which the lysate was incubated with GT-RB ( 379-792 ) in 
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the presence of wild-type E7 peptide prior to 
elect rophoresiS/ indicating specificity of binding. 
Furthermore, the ability of these two cellular 
proteins to interact with 3 V G TK-RB (37 9 - 792) in the 
f^ter binding assay was not inhibited by a point 
mutant (Glu 26 to Gln 26 ) version of the E7 peptide 
defective in RB binding. 

icreenino Whole-celj Lvsa^f^pecif^ca^ 
Interact ing Proteinc ■ 

! J alS ° Pr ° bed £ ° r Proteins 

ZZl ° f bindin9 directly to the -^-i- 

extract was subjected to SD S -polyacrylan,ide gel 

-tt;^\ i :;^r;ir ed to °« - 

«M=h were probed with seleoted "p-labeled fusion 
proteins. A g ain , there appeare<J t<> be ^ ^ ^ 

cellular proteins capable of interacting with 

reoio! K w RB fUSi ° n P L° tei " S in " Mch the T/ZlA-binding 
region was intact ("p-GTK-RBt 379-792) and 

P-GTK-RB(379-9 28) p m ,06., fusion proteins) ■ 

Furthermore, the binding of these cellular 

Proteins to wild-type 3 W-RB ( 379-792, fusion 

protern could be selectively inhibited by three 

RB re bi„d B " bindln5 PePUdeS ' PePt " e " P "=" °* 

acids ^T" 5 SeqU6nCeS f ° Und " T •""•«• <«ino 
acids 102-115, , E1A (tyrosine followed by E1A 

residues U5-132). or E7 ^ ^ 
n ibited binding of the labeled probe to the two 
cellular proteins. In contrast, point- mutant 
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derivatives of these peptides failed to inhibit the 
interaction of the probe with the filter-bound 
cellular proteins.- The particular mutant peptides 
tested included the T antigen peptide with a Glu 1Q7 
to Lys change, the E1A peptide with a Cys 125 to Gly 
change, and the E7 peptide with a Glu„ to Gin 
change. Similar results were obtained when this 
assay was performed with a human Burkitt's lymphoma 
cell line (Akata) as well as with normal peripheral 
blood lymphocytes. 

Use of Labeled GST-fusion Proteins as Probes for the 

Isolation of Genes 

In order to isolate cDNAs encoding cellular 

proteins capable of interacting specifically, and 

directly with RB, an Akata Agtll expression library 

was screened with the 32 P-GTK-RB(379-792) probe as 

described in the protocols above. six Agtll clones 

(clones 1, 3, 4, 5, 6 and 9) encoding 0-galactosidase 

fusion proteins which bound to P-GTK-RB ( 379-792 ) 

with high affinity were plaque purified and subjected 

to further analysis. 

The binding of the fusion proteins encoded by 5 

of these clones (clones 1, 4, 5, 6 and 9) to the 
3 2 

P-GTK-RB (379-792) probe was markedly reduced or 
undetectable in the presence of the wild-type E7 
peptide, while the mutant E7 peptide had no effect on 
binding. Furthermore, the fusion proteins encoded by 
all of the clones bound readily to 
32 P-GTK-RB(379-928) , whereas binding to 
30 32 P-GTK-RB(379-928;706F) , a protein with a mutation 
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in the T/E7/E1A binding region of RB was 
undetectable. Thus, binding to these clones 
displayed RB binding behavior similar to cellular 
Proteins which interact with RB. 

Cross-hybridization experiments, and subsequent 
sequence analysis, demonstrated that 4 of the clones 
(Clones i, 3 , 4 and 6) contained overlapping cDNA 
fragments derived from a common mRNA. The gene 
encoding this mRNA will be referred to as RBAPl 

Sequence analysis of clone 5 predicted that it 
encoded . ,. galactosidase leader polypept . de fused 

the sequence His-Ser-Phe-Leu-Leu-Cys-Asp- G l u - Asn _ Val _ 

Leu xT P ' ThUS ' fUSi ° n Pr ° tein Cont -- the 

Leu x-eys-x-Clu motif common to the viral RB-binding 

-otxfs. Additional cDNA clones related to clone 5 

were obtained by screening a 293 cell library 

Till 5 inSSrt CDNA - SSqUenCe anal ^ sis «* these 

c ones suggested that the short open reading frame in 

clone 5 was generated by the juxtaposition of a 
normally untranslated cDNA segment with the Agtll 
^-galactosidase coding sequence. Thus, while the 
clone may not represent a cellular protein, upon 
expression in Agtll, it encodes a protein with 
prop ertie ( .. g _ f a Leu _ x . Cys . x . Glu 

with viral RB-binding motifs. 

readinoT Cl ° n6 *' * lon * 

reading frame, consistent with the possibility that 

it encodes an RB-interacting protein. 
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Additional Characterization of RBAPl 

The retinoblastoma gene product is believed to 
serve as a cell cycle regulatory element. In 
particular, pRB is thought to contribute to the 
05 regulation of cell cycle progression as cells 
traverse the Gl/S boundary. Consistent with a 
possible role as an RB-interacting protein, RBAPl 
message levels appear to respond to cell, cycle 
events . 

10 A Northern blot was prepared of total RNA 

obtained from resting peripheral blood lymphocytes 
(PBL) and from PBL at various time points following 
stimulation with a cocktail containing PMA, PHA, and 
a calcium ionophore. The Northern blot was probed 

15 with an RBAPl probe from clone 4. A single message 
of about 3.5 kb became detectable 24-36 hours after 
stimulation. 

In a similar experiment, PBL were stimulated to 
enter the cell division cycle in the presence of 

20 hydroxyurea (HU) . Again, a single 3.5 kb mRNA was 
detected with the RBAPl probe. The abundance of the 
message in RNA from HU-treated PBLs increased to a 
maximum at 3 6 hours, and then appeared to plateau. 
Furthermore, the level of the mRNA in cells detected 

25 by the RBAPl probe fell dramatically within 8 hours 
after removal of HU. 

In vitro Binding of RBAPl to pRB 

The RBAPl cDNA was cloned into pGEX-2T and 
expressed in £. coli as a GST-fusion protein. 
30 Glutathione sepharose beads were loaded with the 
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preo I 1 fUSi0n ' S -^< » Proteins were 
rTl; d A b r i in Vitr ° - transition 

Ubeled RB proteins, RB( 3 7 9 - 928 ) and 

oosni; t ™ ° t v usion protei - 

10 "p-ctk » B ,„c ln Vitro translated 

P =TK-RB,37 9 - 928) protein, but not 

P-OTK-RB„ 79 - 928;p „ M6) fus . on prote . n 

^r: r d od ? s experiment further «-* t h e 

the RB qene " int «»«in g ««ct ly with 

Lne KB gene product. 

« a L B " rel " ed Pr0tei " P1 ° 7 

Pic which is°n pr r ein - a =ona enc ° din9 3 «* 

Lain t r; n ogous r the t/eia/e? 

exal I ' Pr ° t ° e ° 1 ' deS " ib ^ *» «» aoove 

a"i P1 ° 7 fUSi ° n Pr ° tain W '- in 

Prefe^'in «» ^ Protein 

Present „ the lysate was captured on a 

usino th ~ ~ " 3 Ph °^-'^tion reaction 

rjAIP ' The [ P]-labeled GTK-m m 
fusion protein was eluted fro™ ^ , P 

i0 -d as prooe in -^Tr^T'.^ 
exacts. Whoie ceU extracts (ml ^ M) contalnino 
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E1A were prepared from 293 cells, subjected to 
SDS-polyacrylamide gel electrophoresis, and electro- 
phoretically transferred to filters. Filters were 
cut into strips and probed with (1) anti-ElA 
05 antibody, (2) [ 32 P] -labeled GTK-pl07 fusion protein, 
(3) [ 32 P]-labeled GTK-pl07 fusion protein in the 
presence of competing wild-type E7 peptide (residues 
16-32), and (4) [ 32 P] -labeled GTK-pl07 fusion protein 
in the presence of mutant E7 peptide (residues 16-32, 

!0 with a Glu to Gin mutation at residue 26) . 

' 32 
Autoradiography indicated that the [ P] -labeled 

GTK-pl07 fusion protein, alone or in the presence of 

mutant E7 peptide, was able to detect the E1A product 

on the blots. The specificity of interaction was 

15 indicated by the observation that the [ P] -labeled 

GTK-pl07 fusion protein did not detect the E1A 

product in the presence of competing wild-type E7 

peptide . 

Example 5 

20 Construction of pAR (&RI ) 59/60 and Properties of 

pFEK-fusion Proteins 

Construction of pAR (aRI ) 59/60 

Plasmid pAR3040 (also referred to as pET3a) was 
the starting material (Studier, Meth. Enzymol . 185 : 
25 60-89 (1990)). This plasmid and expression plasmids 
derived from this vector can be maintained and 
induced for expression as described by Studier 
(Studier, Meth. Enzymol .185 : 60-89 (1990)). Plasmid 
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PAR3 04 0 was cut with EcoRl . The overhangs were made 
blunt by -filling in" using Klenow enzyme and dNTPs 
and religated to destroy the EcoRl site. In an 
infrequent event, the EcoRl site was regenerated on 
propagation of the plasmid. Therefore, another 
version of of this intermediate was constructed by 
treating with mung bean nuclease to remove the EcoRl 
overhangs prior to religation. Regeneration of the 
EcoRl site in the latter version was not observed. 
Both versions of the intermediate construction were 
modified as described below, to make two slightly 
different pAR(.Ri) 59-6O vectors (i.e., differing in 
the manner in which the EcoRl site was destroyed) 
These vectors behave identically with respect to 
expression of encoded fusion proteins. The 
intermediate vectors were then cleaved with Ndel and 
treated with calf intestinal alkaline phosphatase 
(CIP) . 
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Two complementary oligonucleotide adaptors, MAB 
59 and MAB 60, were synthesized, kinased, and 
annealed. The adaptors were then ligated into the 
Ndel cleaved, CIP-treated vectors. The adaptors 
°5 introduce the 17 amino acid sequence comprising the 
FLAG peptide and HMK site shown in Figure 2. The 
sequence of the adaptors is shown below: 

Met Asp Tyr Lys Asp Asp Asp Asp Lys Ala Arg Arg- 

5'-T ATG GAC TAC AAA GAC GAT GAC GAT AAA GCA AGA AGA- 
10 AC CTG ATG TTT CTG cTA CTG CTA TTT CGT TCT TCT- 

Ala Ser Val Gin Phe- 

GCA TCT GTG GAA TTC CA 

CGT AGA CAC CTT AAG GT AT- 5' 

Plasmids having one insert of the double-stranded 
15 adaptor were identified by restriction analysis of 
miniprep DNA. An Xbal-BamHI double digest releases a 
fragment comprising the inserted sequence. The 
structure of resulting plasmids, named pAR(ARI) 59/60, 
was confirmed by dideoxy sequencing using 

20 

oligonucleotide primers MAB 58 
(5'-GCAGCCAACTCAGCTTC-3 7 ) and MAB 69 
( 5 ' -TTAATACGACTCACTAT-3 ' ) . 

Construction of Derivatives of pAR(aRI) 59/60 

Four derivatives of pAR (ARI ) 59/60 were prepared. 
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The farst derivative encoding a selected polypeptide 
was prepared by the insertion of a 1.5 kb EcoRl 
fragment of a shPan-l (N3) cDNA into pAR(ARi) 59/60, 
os which had been cleaved with EcoRl and treated with 
CIP. The shPan-l (N3) cDNA encodes a DNA binding 
protein, as described by German et al . (German et 
— •' M olec. En docrinol . 5: 292-299 (1991)). 

Plasmids containing the insert were detected by 
restriction analysis of miniprep DNA using a 
BamHI-stuI double digest. The correct orientation 
yielded a diagnostic 500 bp fragment. The sequence 
at the junctions between the vector and insert was 
verif led by sequencing, using oligonucleotide priors 
^ MAB 58 and MAB 69 (see above). The plasmid is 
referred to herein as pFEK-shPan-l . 

The second derivative of pAR(^Ri) 59/6O was made 
by the insertion of N3-SH, a fragment of shPan-l 
generated by PGR (German et_al. , Molec. Eng^crinoj 
q 5: 292-299 (1991)). For cloning , the fragment was 
generated by PGR, using oligonucleotide primers MAB " 
72 (5-GGCCGAATTCTCCTGGTCCCACGGAGACCC-3') and MAB 73 
(5 -GGCCGAATTCGCCGAGGAGGACAAGAAGGACC-3 ' ) The PGR 
products were purified on an agarose gel 
5 electroeluted, treated with T< DNA polymerase, and 
digested with EcoRl. The resulting fragment was 
ligated to EcoRl-cut, CIP-treated pAR (ARI) 59/60 . 
constructs with single inserts were identified by 
Xbal-BamHI double digests of nuniprep DNA. The 
junctions and sequence of the insert were verified by 
dideoxy sequencing using oligonucleotides MAB 58 and 
MAB 69 (see above) . The resulting construct is 
referred to herein as P FEK-N3-SH. 
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The third derivative of pAR ( aRI ) 59/60 was 
constructed by the insertion of a DNA sequence 
encoding amino acids 120 through 206 of the rat c-fos 
protein (Kouzarides, T. and E. Ziff , Nature 336 : 
05 646-651 (1988); Curran, T. et al M Oncogene 2: 79-84 

(1987) ; Nakabeppu, Y . and D. Nathans, EM BO J . 8: 
3833-3841 (1989)). The specific fragment used for 
cloning was generated by the polymerase chain 
reaction (PGR) , using oligonucletide primers MAB 70 

10 ( 5 ' -GGCCGAATTCGCGCAGAGCATCGGCAGAAG-3 ' ) and MAB 71 
( 5 ' -GGCCGAATTCCTACTAGATCTTGCAGGCAGGTCGGT-3 ' ) . The 
resulting PCR products were purified on an agarose 
gel, electroeluted , treated with T4 DNA polymerase, 
and digested with EcoRI . The resulting fragment was 

15 ligated into EcoRI-cut, CIP-treated pAR (aRI ) 59 / 60 . 
Isolates with single inserts were identified by Xbal 
and BamHI double digests. The junctions and insert 
sequence were verified by dideoxy sequencing using 
the oligonucleotide primers MAB 58 and MAB69 (see 

20 

above) . The resulting plasmid is referred to herein"' 
as pFEK-c-fos (120-206) . 

The fourth derivative of pAR ( aRI ) 59/60 was 
derived by the insertion of amino acids encoding 
residues 206 through 34 0 of the human c-jun- protein 
25 (Kouzarides, T. and E . Ziff, Nature 336 : 646-651 

(1988) ; Bohmann, D. et al . , Science 238 : 1386-1392 
(1987)). The fragment for cloning was generated by 
PCR using oligonucleotide primers MAB 74 
(5 , -GGCCGAATTCTTTCCCGCGCAACCCCAGCA-3' ) and MAB 7 5 

30 (5 ' -GGCCGAATTCCCGACGGTCTCTCTTCAAA-3 ' ) . The PCR 
products were purified on an agarose gel, 
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electrocuted, treated with T4 DNA polymerase, and 
digested with EcoRI. The resulting fragment was 
ligated to EcoRI-cut, CIP-treated pARURI) 59/60. 

Constructs having a single insert were again 
identified by Xbai-BamHI digestion of mini-prep DNA. 
The junctions and insert sequences were confirmed by 
dideoxy seuqencing using primers MAB 58 and MAB 69 
(see above) . The resulting plasmid is referred to 
herein as pFEK-c-jun (206-340) . 

pFEK-fusion Pro tein s R e t ain Biological Activity ,nH 
Are Effi ciently Phosphorvlated 

As each of the selected polypeptides expressed 
from pARURI)59/60 is a DNA binding protein, the 
activity of the selected polypeptide portions, in the 
context of a fusion protein comprising an affinity 
ligand and modification site, were assayed by 
electrophoretic mobility shift assays (EMSA) . Assay 
conditions for the FEK-shPAN-1 fusion protein encoded 
by pFEK-shPan-1 and by pFEK-N3-SH, were as described 
by German et_jU. (German et al . , Molec." Endocrinol 
5: 292-299 (1991)). EMSA conditions for the fusion 
protein encoded by pFEK-c-fos ( 120-206) were as 
described (Kouzarides, T. and E. Ziff , Nature 336- 
646-651 (1988); Curran, T. et al .. Oncogene 2: 79-84 
(1987); Nakabeppu, Y . and D. Nathans, EMBOJ. 8: 
3833-3841 (1989) ) . 

EMSA conditions for the fusion protein encoded 
by P FEK-c-jun(206-340) were as described (Kouzarides 
T- and E. Ziff, Nature 336: 646-651 (1988); Bohmann, ' 
D. et_al., Science 238: 1386-1392 (1987)). The DNA 
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binding activity of the FEK-c-jun (2 06-34 0) fusion 
protein present in bacterial extracts, was assayed in 
combination with bacterial extracts containing fos 
"core" (to assay binding of jun as a jun-fos 

05 heterodimer) . . 

The four unlabeled fusion proteins were made by 
in vitro transcription/translation. The particular 
DNA probes used in each shift assay were radiolabeled 
for detection of DNA-fusion protein complexes. As a 

*0 positive control, each of the native (i.e., not 

fused) DNA binding proteins was prepared by in vitro 
transcription/translation and subjected to the shift 
assay. The short polypeptide produced by 
pAR(ARI) 59/60, and the transcription/translation 

15 products from a vector control for each 

pFEK-construct in which the selected polypeptide 
insert was in a reverse orientation, provided 
negative controls for DNA binding. As indicated by 
the shift assay, all four pFEK-fusion proteins bound 

20 DNA to an extent comparable to the corresponding 
native protein. These results indicate that the 
incorporation of the selected polypeptides (fos, jun, 
shPan-1 (N3), and N3-SH) into fusion proteins of the 
present invention comprising a modification 

Z5 recognition site, did not alter the biological 
activity (DNA binding) of these proteins. 

In parallel to the experiment just described, in 
vitro transcribed and translated fusion proteins were 
modified by phosphorylation with HMK kinase and 

30 non-radioactive ATP (see below for conditions) . 
Modification by phosphorylation could be 



SUBSTITUTE SHEET 



WO 93/03157 

PCI7US92/06187 



-70- 



distinguished because the negative charge of the 
actional phosphate group on the modified fusion 

05 I """ Pr ° tei " CO " PleX " »ith the 

mo fled protein . DNA c<apl4x _ Modi£lcati 

e t r , the fusion proteins did "~ th. 

extent of complex formation, indicating that the 
biological activity of the 

* selected polypeptide 

Portion as not altered by modification. 



1 ^^^Si £ i^ S r £sl i 2p In BL 2 1 

and '" dUCti0 " ° f eXpreSSl °" *«» pARURI, 59/60 

by Stud PUS "" S d ° ne •»«"««y « ascribed 

(1550,,. Briefly, plasaids were transf ormeTTnto 
competent B L21 or BL 21 p Lys£ bacteria. A sin," 
colony was inoculated into Luria broth with 

ZT'i" " 50 U9/ ^ and " ith P»eniooi at 25 

2'lt ^ " "* C aU °- a «" Proceed until 

20 " t aLT denSity <W ° f •W^i-t.ly i.o was ' 
tta ed. IPTG „ as addad ^ described 

Mfih^Hoi. HS: 60-89 ,„,„„. Cells „ ere 
or another o hours or so at ,,-c. A bacterial 
lysate was prepared using standard teohnigues. 
^ Following expression in E. CBU , the 

unfractionated bacterial e »t r „, , 
carrvi „„ . 1 ext "=t from transformants 

carrying expression vector pFEK-shPan-1 , pFEK-„3-SH 
PFEK-c-fosa.0-.06,, or pFEK-c- j un (2 06-; 4 :, was 
assayed by eleotrophoretio mobility shift assay as in 
the previous section, The results indicated tnat the 
unmodified or modifier! ,u w 

modified (by phosphorylation with 
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unlabeled ATP) FEK-fusion proteins expressed in 
bacterial extracts retained the biological activity 
of the selected polypeptide portion. 

Partial Purification of Proteins Produced : 

Following expression in E. coli , bacterial 
extracts from cells transformed with a pFEK- vector 
were fractionated to obtain partially purified 
FEK-fusion proteins by one of two methods. 

DEAE-Sephacel chromatography : 

-used as per directions of supplier 

(Pharmacia-PL) 
-buffer contained 100 mM NaCl and 10% glycerol 
(in addition to the buffer constituents 
presents in the bacterial lysates) 
-treated in "batch 11 for 60 minutes at +4°C 
-lysate was incubated with resin with 
continuous gentle agitation 
-supernatant solution passed over a 2 ml 

column of resin 
-flow-through collected 

The fusion proteins encoded by vectors 
pFEK-shPan-1 and pFEK-N3-SH were obtained using 
DEAE-Sephacel chromatography at -50% purity. 

Heparin-Sepharose chromatography: 
-used as per directions of supplier 
(Pharmacia-PL) 



A) 
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-DEAE flow-through fraction from protocol (A) 
was loaded onto a 1 ml bed volume column of 
resin 

-resin washed with 10 column volumes of load 

buffer (i.e., ioo mM salt) 
-proteins were eluted with l column volume each 
of 200, 300, 400, 500, 600, 700, 800, 900 and 
1000 mM salt 
-fractions were tested for the presence of 
active protein by the appropriate assay 

For example, in the case of a fusion protein 
comprising the fos 'core' (encoded by 
pFEK-c-f os (120-206)), activity of fractions was 
assayed by electrophoretic mobility shift assay 
together with reticulocyte-produced c-jun protein. 
Peak activity for the fusion protein comprising the 
fos 'core' fusion protein was eluted at approximately 
500-600 mM salt yielding a preparation with -50% 
purity. 

Phospho rylation of Proteins with HMK : 

All four fusion proteins produced by the pFEK- 
vectors (pFEK-shPan-1 , pFEK-N3-SH, 

PFEK-c-f os (120-206), and pFEK-c-jun (206-340) ) were 
capable of being modified by phosphorylation when 
present in crude bacterial extracts. As discussed 
above, modification by phosphorylation with unlabeled 
ATP was observed. m addition, modification using 
radioactively-labeled ATP was observed. 
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Proteins which were partially purified by the 
methods (A and B) described above were also 
efficiently radiolabeled by phosphorylation using HMK 
enzyme and [nr -" P]AT p. Labeling tQ a 

0, activity of approximately io 7 to io 8 com per „ g of 
protein was observed for partially purified - 
FEK-fusion proteins encoded by pFEK-shPan-i 
PFEK-N3-SH, and pFEK-c-fos (120-206). The protocols 
used for protein labeling are described below. 

10 £&S5Bhgrylation with Heart Muscle Kin„ co . 

fro m # P " 2645 KinaS6 ' Cata ^tic subunit 

fro, bovxne heart was obtained as lyophilized powder 
in 250 unxt vials. Typically, 250 units of HMK 
enzyme was resuspended in 25 ,1 of 40 mM DTT (i . 
» atiou/.l). The solution was allowed to stand 'at' 
room temperative for 10 minutes and was stored at + 4 
•C Activity was stable for 2-3 days, but HMK was 
usually freshly reconstituted for each use. 

20 „„„ A Preparation of HMK Buffer was prepared 
20 200 Tris-Cl (P H7.5), lOmMDTT, 1MNaC1< and 

120 mM M gci 2 ) . The phosphorylation reaction fixture 
was as follows: 

3 nl (lox) HMK buffer 

■>5 ^ 32 [732P] (S " 9 " NEN " Du P°nt #NEG-035C 

It P 3 -ATP >7000Ci/mmol) 
1-10 ul of protein extract (amount will vary with the 

particular protein) 
1 Ml of io u/^1 HMK . 
water to a total of 30 „1 
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The reaction fixture was incubated at 37 c C for 30 _ 60 
ffllnUt6S ' and was st °"d on ice until use. Sorae 
Proteins were labeled almost as well when the 
o _ mcubation was carried out on ice. The low 

temperature incubation can serve to preserve the 

These procedures can be scaled up as required. 

10 blo^ FEK " fUSi ° n Pr ° teinS Were Western 
blot.mg or for plague screening. For these 

applications, the fusion protein was run a G50 eoll 

ll^T7T " ° raer " re "° Ve -incorporated 

label hitan, the hmk phosphorylation reaction. 

5 7 „ " KC1 " ^"^ « S »*P— KOH (CPH 

IrlLT T " 9 " 2 ' 20% 91ycero1 ' 100 " *«' «• 

LI ■ Solu "=- comprising 2 + kci, and BSA 

> I addSd t0 ' fi " al eventration 

01 1 ^ust prior to use. 

with 5 al of (2 + o.i m kci * i / , 
"*tj and resuspended at n-n in 

-the beads were r^. * "** bUffer 

w ere rotatea at pt for- i w 

thre<1 RT for - 1 hour ' and washed 

- 1 "1 sterile plastic pipette was packed with the 
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washed G50 (bed vol -1.2 ml) and equilibrated at 
room temperature with -50-10 ml of (Z + 0.1 M 
KC1 + 1 mg/ml BSA + l mM DTT) 
-immediately prior to loading the column, the 

total volume of the HMK reaction was brought to 
100 ,il with ice-cold (Z + 0.1 M KC1 + 1 mM DTT) 
-the column was loaded and run at room temperature, 

taking l drop fractions (-45 ^1/drop) 
-fractions were stored immediately on ice 
-2-5 pi aliquots were removed from each fraction and 

counted by Cerenkov counting 
-the excluded peak fractions (usually at 1/3 to 1/2 
the column volume) were pooled 

Optionally aliquots of the fractions (e.g., l.o ^1) 
15 can be fractionated on SDS-polyacrylamide gels to 
monitor the progress of the fusion protein. The 
above procedure can also be carried out at 4 0 c where 
desired. . 

Equivalents 



20 



25 



Those skilled in the art. will be' able to 
recognize, or be able to ascertain, using no more 
than routine experimentation, many equivalents to the 
specific embodiments of the invention described 
herein. Such equivalents are intended to be 
encompassed by the following claims. 
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CLAIMS 



An expression vector comprising a nucleotide 
sequence which encodes at least one affinity 
ligand, at least one modification recognition 
sequence and at least one restriction site for 
the in frame insertion of a nucleotide sequence 
encoding a selected polypeptide. 

The expression vector of Claim i wherein the 
affinity ligand is a glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, and the modification recognition 
sequence is a phosphorylation site. 

The expression vector of Claim l further 
comprising a nucleotide sequence which encodes a 
cleavable linker located between the affinity 
ligand and the restriction site for insertion of 
a selected polypeptide. 

The expression vector of claim 1, wherein the 
affinity ligand is a glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, the modification recognition sequence is 
a phosphorylation site, and the cleavable linker 
is selected from the group consisting of a 
thrombin cleavage site, Factor Xa cleavage site, 
and enterokinase cleavage site. 
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5. The expression vector of Claim 4, wherein the 
vector is pGEX-2TK. 

6. The expression vector of Claim 4, wherein the 
vector is pAR(ARI) 59/60. 

7. An expression vector comprising a nucleotide 
sequence which encodes a fusion protein 
comprising an affinity ligand, a modification 
recognition sequence and a selected polypeptide, 

6. The expression vector of Claim 7, wherein the 
10 affinity ligand is a glutathione-S-transf erase 

affinity ligand or a FLAG epitope affinity 
ligand, and the modification recognition 
sequence is a phosphorylation site. 



15 



9. The expression vector of Claim 7 wherein the 

nucleotide sequence further encodes a cleavable 
linker located between the affinity ligand and 
the selected polypeptide. 
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The expression vector of Claim 9, wherein the 
affinity ligand is glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, the modification recognition sequence is 
a phosphorylation site, and the cleavable linker 
is selected from the group consisting of a 
thrombin cleavage site, factor Xa cleavage site 
and enterokinase cleavage site. 
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il. A method of producing a labeled fusion protein 
comprising: 

a) expressing a fusion protein comprising an 
affinity ligand, a phosphorylation 

05 recognition sequence and a polypeptide in a 

host cell using an expression vector; 

b) lysing the host cells to obtain a lysate 
containing the fusion protein; 

c) contacting the lysate with an affinity 
matrix under conditions sufficient to bind 
the affinity ligand portion of the fusion 
protein to the affinity matrix; 

d) washing the product of step (c) to remove 
unbound material;, and 

° e) contacting the bound fusion protein with 

[7-labeled]ATP and a kinase reaction buffer 
comprising a protein kinase under 
conditions appropriate for phosphorylation 
of the bound fusion protein to thereby 

~° produce a labeled fusion protein which is 

bound to the affinity matrix. 



12. 



25 



The method of claim li further comprising' the 
steps of: 

f) optionally stopping the reaction of step 
(e) by addition of a stop buffer; 

g) washing the affinity matrix with 
labeled fusion protein bound thereto to 
remove unincorporated label; and 



i 

! 

I 
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h) eluting the labeled fusion protein 

with a suitable elution buffer comprising a 
release component thereby obtaining a 
labeled fusion protein. 

The method of Claim 11 further comprising the 
steps of: 

f) optionally stopping the reaction of step 
(e) by addition of a stop 
buffer; 

g) washing the bound labeled' fusion protein to 
remove unincorporated label; and 

h) cleaving the labeled fusion protein with a 
site specific protease to produce a labeled 
selected polypeptide portion. 

15- 14 . A method of producing a labeled fusion protein 
comprising: 

a) expressing a fusion protein comprising a 
GST affinity ligand, a phosphorylation 
recognition sequence and a polypeptide in a 

20 host cell using an * expression vector; 

b) lysing the host cells to obtain a 
lysate containing the fusion protein; 

c) contacting the lysate with immobilized 
glutathione under conditions sufficient to 

25 bind the GST portion of the fusion protein 

to bind to glutathione, thereby obtaining 
immobilized glutathione with the bound 
fusion protein; 
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d) 



washing the immobilized glutathione with 
bound fusion protein to remove unbound 
material; and 
e) contacting the bound fusion protein with 

[7 -labeled] ATP and a kinase reaction buffer 
comprising a protein kinase under 
conditions appropriate for phosphorylation 
of the bound fusion protein to thereby 
produce immobilized glutathione with " 
labeled fusion protein which is bound to 
immobilized glutathione. 

The method of claim 14 wherein the protein 
kinase is the catalytic subunit of a 
cAMP-dependent protein kinase. 

16. The method of Claim 14 wherein the [7 -labeled] ATP 
is [7- P]ATP. 



The method of claim 14 further comprising the 
steps of: 

f) optionally stopping the reaction of step 

(e) by addition of a stop buffer; 
9) washing the immobilized glutathione with 
bound labeled fusion protein to remove 
unincorporated label; and 
h) eluting the bound labeled fusion 

protein with a solution comprising reduced 
glutathione thereby obtaining a labeled 
fusion protein. 
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18. The method of Claim 14 further comprising the 
steps of: 

f ) optionally stopping the reaction of step 
(e) by addition of a stop buffer; 
05 g) washing the immobilized glutathione with 

bound labeled fusion protein to remove 
unincorporated label; and 
h) cleaving the labeled fusion protein with 
thrombin to produce a labeled selected 
10 polypeptide portion. 

19. A kit for preparing a radiolabeled 
glutathione-S-transf erase (GST) fusion protein 
comprising: 

a) an immobilized glutathione affinity matrix; 
15 b) a wash buffer which permits binding of a 

GST-fusion protein to the affinity matrix; 

c) a kinase reaction buffer; 

d) a protein kinase preparation; 

e) an elution buffer comprising reduced 
glutathione for releasing the fusion 
protein from the affinity matrix; and 
optionally 

f) a reaction stop buffer. 
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20. A labeled fusion protein comprising an affinity 
ligand, a modification recognition sequence, and 
a polypeptide. 
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21. The labeled fusion protein of claim 20, wherein 
the affinity ligand is a 

glutathione-S-transferase affinity li gan d and 
the modification recognition sequence is a 
phosphorylation recognition sequence. 



22 



10 



23 



The labeled fusion protein of Claim 20, wherein 
the affinity li gand is the FLAG epitope> ^ 

modification recognition sequence is a 
phosphorylation recognition sequence. 

A host cell containing the expression vector of 
Claim l. 
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Enterokinose 
I 



N-Met- 



FLAG 



HMK 



-Glu-Phe-Prote 



in 



ci Ar Enter okinose 
FLAG peptide 



+ 50 

Met Asp Tyr Lys Asp Asp Asp Asp Lys ' A/a 
• • MGGAGATATA CATATG GAC TAC AAA GAC GAT GAC GAT AAA GCA 

Ndel 

HMK recognition 



Arg Arg Ala Ser Vol Glu Phe 
AGA AGA GCA TCT GTG GAA TTC CATATG 
^ EcoRl Ndel 

~~ " "V — " 
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